2025-05-07T19:42:32.7122508Z Current runner version: '2.323.0' 2025-05-07T19:42:32.7128791Z Runner name: 'i-05f9a7b1831803d00' 2025-05-07T19:42:32.7129753Z Machine name: 'ip-10-0-68-91' 2025-05-07T19:42:32.7132812Z ##[group]GITHUB_TOKEN Permissions 2025-05-07T19:42:32.7135154Z Contents: read 2025-05-07T19:42:32.7135713Z Metadata: read 2025-05-07T19:42:32.7136285Z Packages: read 2025-05-07T19:42:32.7136887Z ##[endgroup] 2025-05-07T19:42:32.7139623Z Secret source: None 2025-05-07T19:42:32.7140666Z Prepare workflow directory 2025-05-07T19:42:32.7785987Z Prepare all required actions 2025-05-07T19:42:32.7822937Z Getting action download info 2025-05-07T19:42:32.9782503Z Download action repository 'actions/checkout@v4' (SHA:11bd71901bbe5b1630ceea73d27597364c9af683) 2025-05-07T19:42:33.2537056Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-05-07T19:42:33.7958380Z Complete job name: build_artifact (x86, linux.24xlarge, default, 3.10, 11.8.0, clang) 2025-05-07T19:42:33.8826573Z A job started hook has been configured by the self-hosted runner administrator 2025-05-07T19:42:33.8956863Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-05-07T19:42:33.8967884Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:33.8969259Z ##[endgroup] 2025-05-07T19:42:34.9975245Z Runner Type: linux.24xlarge 2025-05-07T19:42:34.9975772Z Instance Type: c5.24xlarge 2025-05-07T19:42:34.9976088Z AMI Name: unknown 2025-05-07T19:42:35.0009829Z AMI ID: ami-071226ecf16aa7d96 2025-05-07T19:42:40.0842315Z ##[group]Checking docker version 2025-05-07T19:42:40.0855397Z ##[command]/usr/bin/docker version --format '{{.Server.APIVersion}}' 2025-05-07T19:42:40.1069856Z '1.44' 2025-05-07T19:42:40.1085912Z Docker daemon API version: '1.44' 2025-05-07T19:42:40.1086467Z ##[command]/usr/bin/docker version --format '{{.Client.APIVersion}}' 2025-05-07T19:42:40.1288907Z '1.44' 2025-05-07T19:42:40.1303892Z Docker client API version: '1.44' 2025-05-07T19:42:40.1311082Z ##[endgroup] 2025-05-07T19:42:40.1315066Z ##[group]Clean up resources from previous jobs 2025-05-07T19:42:40.1321053Z ##[command]/usr/bin/docker ps --all --quiet --no-trunc --filter "label=ab9abe" 2025-05-07T19:42:40.1495897Z ##[command]/usr/bin/docker network prune --force --filter "label=ab9abe" 2025-05-07T19:42:40.1644797Z ##[endgroup] 2025-05-07T19:42:40.1645181Z ##[group]Create local container network 2025-05-07T19:42:40.1654066Z ##[command]/usr/bin/docker network create --label ab9abe github_network_23c588a7f7ea4423950955615706ca02 2025-05-07T19:42:40.3854501Z 56803a4ab21fa5d6c3c94e039640a4e85913162028d811281dd27212e781415d 2025-05-07T19:42:40.3871686Z ##[endgroup] 2025-05-07T19:42:40.3896709Z ##[group]Starting job container 2025-05-07T19:42:40.3916429Z ##[command]/usr/bin/docker pull amazonlinux:2023 2025-05-07T19:42:40.5251232Z 2023: Pulling from library/amazonlinux 2025-05-07T19:42:40.5356925Z Digest: sha256:cb5b4c509d62ae388f674c139ae5e8281fc160c217d474445e912043e1941988 2025-05-07T19:42:40.5358665Z Status: Image is up to date for amazonlinux:2023 2025-05-07T19:42:40.5380564Z docker.io/library/amazonlinux:2023 2025-05-07T19:42:40.5462441Z ##[command]/usr/bin/docker create --name e02b70d7a92a414ba22decd9f7117672_amazonlinux2023_04e192 --label ab9abe --workdir /__w/FBGEMM/FBGEMM --network github_network_23c588a7f7ea4423950955615706ca02 --user root -e "HOME=/github/home" -e GITHUB_ACTIONS=true -e CI=true -v "/var/run/docker.sock":"/var/run/docker.sock" -v "/home/ec2-user/actions-runner/_work":"/__w" -v "/home/ec2-user/actions-runner/externals":"/__e":ro -v "/home/ec2-user/actions-runner/_work/_temp":"/__w/_temp" -v "/home/ec2-user/actions-runner/_work/_actions":"/__w/_actions" -v "/home/ec2-user/actions-runner/_work/_tool":"/__w/_tool" -v "/home/ec2-user/actions-runner/_work/_temp/_github_home":"/github/home" -v "/home/ec2-user/actions-runner/_work/_temp/_github_workflow":"/github/workflow" --entrypoint "tail" amazonlinux:2023 "-f" "/dev/null" 2025-05-07T19:42:40.5877479Z 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa 2025-05-07T19:42:40.5901402Z ##[command]/usr/bin/docker start 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa 2025-05-07T19:42:41.0294109Z 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa 2025-05-07T19:42:41.0317582Z ##[command]/usr/bin/docker ps --all --filter id=9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa --filter status=running --no-trunc --format "{{.ID}} {{.Status}}" 2025-05-07T19:42:41.0461437Z 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa Up Less than a second 2025-05-07T19:42:41.0484986Z ##[command]/usr/bin/docker inspect --format "{{range .Config.Env}}{{println .}}{{end}}" 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa 2025-05-07T19:42:41.0649353Z HOME=/github/home 2025-05-07T19:42:41.0650413Z GITHUB_ACTIONS=true 2025-05-07T19:42:41.0651595Z CI=true 2025-05-07T19:42:41.0652765Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:41.0674474Z ##[endgroup] 2025-05-07T19:42:41.0683754Z ##[group]Waiting for all services to be ready 2025-05-07T19:42:41.0685848Z ##[endgroup] 2025-05-07T19:42:41.0763092Z ##[group]Run yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:41.0763978Z yum update -y; yum install -y binutils findutils git pciutils sudo tar wget which 2025-05-07T19:42:41.0764902Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:41.0765485Z env: 2025-05-07T19:42:41.0765811Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:41.0766286Z BUILD_ENV: build_binary 2025-05-07T19:42:41.0766567Z BUILD_TARGET: default 2025-05-07T19:42:41.0766947Z BUILD_VARIANT: cuda 2025-05-07T19:42:41.0767233Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:41.0767557Z ##[endgroup] 2025-05-07T19:42:41.7970270Z Amazon Linux 2023 repository 89 MB/s | 37 MB 00:00 2025-05-07T19:42:48.4066469Z Last metadata expiration check: 0:00:07 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:48.9712660Z Dependencies resolved. 2025-05-07T19:42:48.9888572Z Nothing to do. 2025-05-07T19:42:48.9889142Z Complete! 2025-05-07T19:42:49.2199711Z Last metadata expiration check: 0:00:08 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:49.2834302Z Dependencies resolved. 2025-05-07T19:42:49.3061659Z ======================================================================================== 2025-05-07T19:42:49.3062571Z Package Arch Version Repository Size 2025-05-07T19:42:49.3063264Z ======================================================================================== 2025-05-07T19:42:49.3063687Z Installing: 2025-05-07T19:42:49.3064153Z binutils x86_64 2.41-50.amzn2023.0.3 amazonlinux 5.3 M 2025-05-07T19:42:49.3064771Z findutils x86_64 1:4.8.0-2.amzn2023.0.2 amazonlinux 539 k 2025-05-07T19:42:49.3065377Z git x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 54 k 2025-05-07T19:42:49.3066062Z pciutils x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 93 k 2025-05-07T19:42:49.3066652Z sudo x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 1.3 M 2025-05-07T19:42:49.3067233Z tar x86_64 2:1.34-1.amzn2023.0.4 amazonlinux 879 k 2025-05-07T19:42:49.3067739Z wget x86_64 1.21.3-1.amzn2023.0.4 amazonlinux 779 k 2025-05-07T19:42:49.3068380Z which x86_64 2.21-26.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:49.3068890Z Installing dependencies: 2025-05-07T19:42:49.3069315Z cracklib x86_64 2.9.6-27.amzn2023.0.2 amazonlinux 82 k 2025-05-07T19:42:49.3070049Z cyrus-sasl-lib x86_64 2.1.27-18.amzn2023.0.3 amazonlinux 786 k 2025-05-07T19:42:49.3070736Z elfutils-debuginfod-client x86_64 0.188-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.3071420Z git-core x86_64 2.47.1-1.amzn2023.0.2 amazonlinux 4.7 M 2025-05-07T19:42:49.3072388Z git-core-doc noarch 2.47.1-1.amzn2023.0.2 amazonlinux 2.8 M 2025-05-07T19:42:49.3073089Z gnutls x86_64 3.8.3-6.amzn2023.0.1 amazonlinux 1.1 M 2025-05-07T19:42:49.3073697Z groff-base x86_64 1.22.4-7.amzn2023.0.2 amazonlinux 1.0 M 2025-05-07T19:42:49.3074317Z gzip x86_64 1.12-1.amzn2023.0.1 amazonlinux 160 k 2025-05-07T19:42:49.3074935Z hwdata noarch 0.384-1.amzn2023.0.3 amazonlinux 1.6 M 2025-05-07T19:42:49.3075574Z jansson x86_64 2.14-0.amzn2023 amazonlinux 46 k 2025-05-07T19:42:49.3076229Z kmod-libs x86_64 29-2.amzn2023.0.5 amazonlinux 62 k 2025-05-07T19:42:49.3076836Z less x86_64 608-2.amzn2023.0.2 amazonlinux 168 k 2025-05-07T19:42:49.3077560Z libcbor x86_64 0.7.0-3.amzn2023.0.2 amazonlinux 57 k 2025-05-07T19:42:49.3078256Z libdb x86_64 5.3.28-49.amzn2023.0.2 amazonlinux 756 k 2025-05-07T19:42:49.3078874Z libeconf x86_64 0.4.0-1.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:49.3079450Z libedit x86_64 3.1-38.20210714cvs.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:49.3080095Z libfdisk x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 153 k 2025-05-07T19:42:49.3080674Z libfido2 x86_64 1.10.0-2.amzn2023.0.2 amazonlinux 95 k 2025-05-07T19:42:49.3081385Z libmetalink x86_64 0.1.3-14.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:49.3082101Z libpwquality x86_64 1.4.4-6.amzn2023.0.2 amazonlinux 106 k 2025-05-07T19:42:49.3082711Z libsemanage x86_64 3.4-5.amzn2023.0.2 amazonlinux 121 k 2025-05-07T19:42:49.3083365Z libutempter x86_64 1.2.1-4.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:49.3083945Z nano x86_64 8.3-1.amzn2023 amazonlinux 706 k 2025-05-07T19:42:49.3084571Z ncurses x86_64 6.2-4.20200222.amzn2023.0.6 amazonlinux 394 k 2025-05-07T19:42:49.3085271Z nettle x86_64 3.10.1-1.amzn2023.0.1 amazonlinux 573 k 2025-05-07T19:42:49.3085997Z openldap x86_64 2.4.57-6.amzn2023.0.7 amazonlinux 256 k 2025-05-07T19:42:49.3086851Z openssh x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 454 k 2025-05-07T19:42:49.3087510Z openssh-clients x86_64 8.7p1-8.amzn2023.0.14 amazonlinux 708 k 2025-05-07T19:42:49.3187847Z pam x86_64 1.5.1-8.amzn2023.0.4 amazonlinux 542 k 2025-05-07T19:42:49.3188506Z pciutils-libs x86_64 3.7.0-3.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.3189127Z perl-AutoLoader noarch 5.74-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:49.3189685Z perl-B x86_64 1.80-477.amzn2023.0.6 amazonlinux 179 k 2025-05-07T19:42:49.3190283Z perl-Carp noarch 1.50-458.amzn2023.0.2 amazonlinux 29 k 2025-05-07T19:42:49.3190996Z perl-Class-Struct noarch 0.66-477.amzn2023.0.6 amazonlinux 22 k 2025-05-07T19:42:49.3191643Z perl-Data-Dumper x86_64 2.174-460.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:49.3192236Z perl-Digest noarch 1.20-1.amzn2023.0.2 amazonlinux 26 k 2025-05-07T19:42:49.3192896Z perl-Digest-MD5 x86_64 2.58-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:49.3193496Z perl-DynaLoader x86_64 1.47-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:49.3194284Z perl-Encode x86_64 4:3.15-462.amzn2023.0.2 amazonlinux 1.7 M 2025-05-07T19:42:49.3194832Z perl-Errno x86_64 1.30-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:49.3195394Z perl-Error noarch 1:0.17029-5.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.3195971Z perl-Exporter noarch 5.74-459.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:49.3196546Z perl-Fcntl x86_64 1.13-477.amzn2023.0.6 amazonlinux 21 k 2025-05-07T19:42:49.3197132Z perl-File-Basename noarch 2.85-477.amzn2023.0.6 amazonlinux 18 k 2025-05-07T19:42:49.3197748Z perl-File-Find noarch 1.37-477.amzn2023.0.6 amazonlinux 26 k 2025-05-07T19:42:49.3198334Z perl-File-Path noarch 2.18-2.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:49.3198935Z perl-File-Temp noarch 1:0.231.100-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:49.3199644Z perl-File-stat noarch 1.09-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:49.3200246Z perl-FileHandle noarch 2.03-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:49.3200868Z perl-Getopt-Long noarch 1:2.52-2.amzn2023.0.2 amazonlinux 60 k 2025-05-07T19:42:49.3201492Z perl-Getopt-Std noarch 1.12-477.amzn2023.0.6 amazonlinux 16 k 2025-05-07T19:42:49.3202136Z perl-Git noarch 2.47.1-1.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:49.3202722Z perl-HTTP-Tiny noarch 0.078-1.amzn2023.0.3 amazonlinux 56 k 2025-05-07T19:42:49.3203271Z perl-IO x86_64 1.43-477.amzn2023.0.6 amazonlinux 87 k 2025-05-07T19:42:49.3203837Z perl-IPC-Open3 noarch 1.21-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:49.3204428Z perl-MIME-Base64 x86_64 3.16-2.amzn2023.0.2 amazonlinux 31 k 2025-05-07T19:42:49.3205033Z perl-Net-SSLeay x86_64 1.94-1.amzn2023.0.1 amazonlinux 392 k 2025-05-07T19:42:49.3205599Z perl-POSIX x86_64 1.94-477.amzn2023.0.6 amazonlinux 97 k 2025-05-07T19:42:49.3206181Z perl-PathTools x86_64 3.78-459.amzn2023.0.2 amazonlinux 85 k 2025-05-07T19:42:49.3206793Z perl-Pod-Escapes noarch 1:1.07-458.amzn2023.0.2 amazonlinux 20 k 2025-05-07T19:42:49.3207402Z perl-Pod-Perldoc noarch 3.28.01-459.amzn2023.0.3 amazonlinux 84 k 2025-05-07T19:42:49.3208024Z perl-Pod-Simple noarch 1:3.42-2.amzn2023.0.2 amazonlinux 215 k 2025-05-07T19:42:49.3208618Z perl-Pod-Usage noarch 4:2.01-2.amzn2023.0.2 amazonlinux 41 k 2025-05-07T19:42:49.3209254Z perl-Scalar-List-Utils x86_64 4:1.56-459.amzn2023.0.2 amazonlinux 71 k 2025-05-07T19:42:49.3209902Z perl-SelectSaver noarch 1.02-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:49.3210480Z perl-Socket x86_64 4:2.032-1.amzn2023.0.2 amazonlinux 55 k 2025-05-07T19:42:49.3211050Z perl-Storable x86_64 1:3.21-458.amzn2023.0.2 amazonlinux 96 k 2025-05-07T19:42:49.3211612Z perl-Symbol noarch 1.08-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:49.3212228Z perl-Term-ANSIColor noarch 5.01-459.amzn2023.0.2 amazonlinux 48 k 2025-05-07T19:42:49.3212844Z perl-Term-Cap noarch 1.17-458.amzn2023.0.2 amazonlinux 22 k 2025-05-07T19:42:49.3213430Z perl-TermReadKey x86_64 2.38-9.amzn2023.0.2 amazonlinux 36 k 2025-05-07T19:42:49.3214066Z perl-Text-ParseWords noarch 3.30-458.amzn2023.0.2 amazonlinux 17 k 2025-05-07T19:42:49.3214712Z perl-Text-Tabs+Wrap noarch 2021.0726-1.amzn2023.0.1 amazonlinux 22 k 2025-05-07T19:42:49.3215406Z perl-Time-Local noarch 2:1.300-5.amzn2023.0.2 amazonlinux 34 k 2025-05-07T19:42:49.3215977Z perl-URI noarch 5.09-1.amzn2023.0.2 amazonlinux 108 k 2025-05-07T19:42:49.3216540Z perl-base noarch 2.27-477.amzn2023.0.6 amazonlinux 17 k 2025-05-07T19:42:49.3217123Z perl-constant noarch 1.33-459.amzn2023.0.2 amazonlinux 23 k 2025-05-07T19:42:49.3217680Z perl-if noarch 0.60.800-477.amzn2023.0.6 amazonlinux 14 k 2025-05-07T19:42:49.3218255Z perl-interpreter x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 71 k 2025-05-07T19:42:49.3218809Z perl-lib x86_64 0.65-477.amzn2023.0.6 amazonlinux 15 k 2025-05-07T19:42:49.3219366Z perl-libnet noarch 3.13-2.amzn2023.0.2 amazonlinux 126 k 2025-05-07T19:42:49.3219918Z perl-libs x86_64 4:5.32.1-477.amzn2023.0.6 amazonlinux 2.0 M 2025-05-07T19:42:49.3220518Z perl-mro x86_64 1.23-477.amzn2023.0.6 amazonlinux 29 k 2025-05-07T19:42:49.3221081Z perl-overload noarch 1.31-477.amzn2023.0.6 amazonlinux 46 k 2025-05-07T19:42:49.3221688Z perl-overloading noarch 0.02-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:49.3222274Z perl-parent noarch 1:0.238-458.amzn2023.0.2 amazonlinux 14 k 2025-05-07T19:42:49.3222868Z perl-podlators noarch 1:4.14-458.amzn2023.0.2 amazonlinux 112 k 2025-05-07T19:42:49.3223436Z perl-subs noarch 1.03-477.amzn2023.0.6 amazonlinux 12 k 2025-05-07T19:42:49.3223993Z perl-vars noarch 1.05-477.amzn2023.0.6 amazonlinux 13 k 2025-05-07T19:42:49.3224549Z shadow-utils x86_64 2:4.9-12.amzn2023.0.4 amazonlinux 1.1 M 2025-05-07T19:42:49.3225102Z systemd-libs x86_64 252.23-3.amzn2023 amazonlinux 613 k 2025-05-07T19:42:49.3225757Z util-linux x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 2.2 M 2025-05-07T19:42:49.3226287Z util-linux-core x86_64 2.37.4-1.amzn2023.0.4 amazonlinux 432 k 2025-05-07T19:42:49.3226734Z Installing weak dependencies: 2025-05-07T19:42:49.3227178Z nano-default-editor noarch 8.3-1.amzn2023 amazonlinux 10 k 2025-05-07T19:42:49.3227785Z perl-IO-Socket-IP noarch 0.41-3.amzn2023.0.2 amazonlinux 42 k 2025-05-07T19:42:49.3228388Z perl-IO-Socket-SSL noarch 2.075-1.amzn2023.0.2 amazonlinux 218 k 2025-05-07T19:42:49.3228975Z perl-Mozilla-CA noarch 20200520-4.amzn2023.0.2 amazonlinux 13 k 2025-05-07T19:42:49.3229544Z perl-NDBM_File x86_64 1.15-477.amzn2023.0.6 amazonlinux 23 k 2025-05-07T19:42:49.3230106Z sudo-python-plugin x86_64 1.9.15-1.p5.amzn2023.0.1 amazonlinux 56 k 2025-05-07T19:42:49.3230473Z 2025-05-07T19:42:49.3230571Z Transaction Summary 2025-05-07T19:42:49.3230864Z ======================================================================================== 2025-05-07T19:42:49.3231189Z Install 107 Packages 2025-05-07T19:42:49.3231657Z 2025-05-07T19:42:49.3231822Z Total download size: 38 M 2025-05-07T19:42:49.3232137Z Installed size: 151 M 2025-05-07T19:42:49.3232399Z Downloading Packages: 2025-05-07T19:42:49.6145878Z (1/107): cracklib-2.9.6-27.amzn2023.0.2.x86_64. 3.9 MB/s | 82 kB 00:00 2025-05-07T19:42:49.6242859Z (2/107): elfutils-debuginfod-client-0.188-3.amz 6.1 MB/s | 41 kB 00:00 2025-05-07T19:42:49.6323383Z (3/107): cyrus-sasl-lib-2.1.27-18.amzn2023.0.3. 20 MB/s | 786 kB 00:00 2025-05-07T19:42:49.6546732Z (4/107): binutils-2.41-50.amzn2023.0.3.x86_64.r 87 MB/s | 5.3 MB 00:00 2025-05-07T19:42:49.6594961Z (5/107): findutils-4.8.0-2.amzn2023.0.2.x86_64. 17 MB/s | 539 kB 00:00 2025-05-07T19:42:49.6604353Z (6/107): git-2.47.1-1.amzn2023.0.2.x86_64.rpm 1.9 MB/s | 54 kB 00:00 2025-05-07T19:42:49.6813619Z (7/107): gnutls-3.8.3-6.amzn2023.0.1.x86_64.rpm 55 MB/s | 1.1 MB 00:00 2025-05-07T19:42:49.7119089Z (8/107): git-core-2.47.1-1.amzn2023.0.2.x86_64. 84 MB/s | 4.7 MB 00:00 2025-05-07T19:42:49.7251617Z (9/107): git-core-doc-2.47.1-1.amzn2023.0.2.noa 44 MB/s | 2.8 MB 00:00 2025-05-07T19:42:49.7328984Z (10/107): groff-base-1.22.4-7.amzn2023.0.2.x86_ 23 MB/s | 1.0 MB 00:00 2025-05-07T19:42:49.7362405Z (11/107): gzip-1.12-1.amzn2023.0.1.x86_64.rpm 16 MB/s | 160 kB 00:00 2025-05-07T19:42:49.7457992Z (12/107): hwdata-0.384-1.amzn2023.0.3.noarch.rp 86 MB/s | 1.6 MB 00:00 2025-05-07T19:42:49.7476265Z (13/107): jansson-2.14-0.amzn2023.x86_64.rpm 3.3 MB/s | 46 kB 00:00 2025-05-07T19:42:49.7496563Z (14/107): kmod-libs-29-2.amzn2023.0.5.x86_64.rp 5.3 MB/s | 62 kB 00:00 2025-05-07T19:42:49.7553420Z (15/107): less-608-2.amzn2023.0.2.x86_64.rpm 18 MB/s | 168 kB 00:00 2025-05-07T19:42:49.7590109Z (16/107): libcbor-0.7.0-3.amzn2023.0.2.x86_64.r 6.4 MB/s | 57 kB 00:00 2025-05-07T19:42:49.7643683Z (17/107): libdb-5.3.28-49.amzn2023.0.2.x86_64.r 52 MB/s | 756 kB 00:00 2025-05-07T19:42:49.7660923Z (18/107): libeconf-0.4.0-1.amzn2023.0.3.x86_64. 2.6 MB/s | 28 kB 00:00 2025-05-07T19:42:49.7682387Z (19/107): libedit-3.1-38.20210714cvs.amzn2023.0 13 MB/s | 108 kB 00:00 2025-05-07T19:42:49.7719614Z (20/107): libfdisk-2.37.4-1.amzn2023.0.4.x86_64 21 MB/s | 153 kB 00:00 2025-05-07T19:42:49.7749108Z (21/107): libfido2-1.10.0-2.amzn2023.0.2.x86_64 11 MB/s | 95 kB 00:00 2025-05-07T19:42:49.7764030Z (22/107): libmetalink-0.1.3-14.amzn2023.0.2.x86 3.9 MB/s | 31 kB 00:00 2025-05-07T19:42:49.7793431Z (23/107): libpwquality-1.4.4-6.amzn2023.0.2.x86 16 MB/s | 106 kB 00:00 2025-05-07T19:42:49.7820587Z (24/107): libsemanage-3.4-5.amzn2023.0.2.x86_64 18 MB/s | 121 kB 00:00 2025-05-07T19:42:49.7831683Z (25/107): libutempter-1.2.1-4.amzn2023.0.2.x86_ 4.1 MB/s | 26 kB 00:00 2025-05-07T19:42:49.7899834Z (26/107): nano-8.3-1.amzn2023.x86_64.rpm 68 MB/s | 706 kB 00:00 2025-05-07T19:42:49.7916023Z (27/107): nano-default-editor-8.3-1.amzn2023.no 1.1 MB/s | 10 kB 00:00 2025-05-07T19:42:49.7949946Z (28/107): ncurses-6.2-4.20200222.amzn2023.0.6.x 34 MB/s | 394 kB 00:00 2025-05-07T19:42:49.8022859Z (29/107): nettle-3.10.1-1.amzn2023.0.1.x86_64.r 49 MB/s | 573 kB 00:00 2025-05-07T19:42:49.8065094Z (30/107): openldap-2.4.57-6.amzn2023.0.7.x86_64 23 MB/s | 256 kB 00:00 2025-05-07T19:42:49.8101520Z (31/107): openssh-8.7p1-8.amzn2023.0.14.x86_64. 30 MB/s | 454 kB 00:00 2025-05-07T19:42:49.8155481Z (32/107): openssh-clients-8.7p1-8.amzn2023.0.14 57 MB/s | 708 kB 00:00 2025-05-07T19:42:49.8197322Z (33/107): pciutils-3.7.0-3.amzn2023.0.2.x86_64. 11 MB/s | 93 kB 00:00 2025-05-07T19:42:49.8252905Z (34/107): pam-1.5.1-8.amzn2023.0.4.x86_64.rpm 39 MB/s | 542 kB 00:00 2025-05-07T19:42:49.8260337Z (35/107): pciutils-libs-3.7.0-3.amzn2023.0.2.x8 3.9 MB/s | 41 kB 00:00 2025-05-07T19:42:49.8278164Z (36/107): perl-AutoLoader-5.74-477.amzn2023.0.6 3.3 MB/s | 22 kB 00:00 2025-05-07T19:42:49.8337327Z (37/107): perl-B-1.80-477.amzn2023.0.6.x86_64.r 25 MB/s | 179 kB 00:00 2025-05-07T19:42:49.8357855Z (38/107): perl-Carp-1.50-458.amzn2023.0.2.noarc 3.2 MB/s | 29 kB 00:00 2025-05-07T19:42:49.8373048Z (39/107): perl-Class-Struct-0.66-477.amzn2023.0 2.5 MB/s | 22 kB 00:00 2025-05-07T19:42:49.8395141Z (40/107): perl-Data-Dumper-2.174-460.amzn2023.0 10 MB/s | 55 kB 00:00 2025-05-07T19:42:49.8422208Z (41/107): perl-Digest-1.20-1.amzn2023.0.2.noarc 5.7 MB/s | 26 kB 00:00 2025-05-07T19:42:49.8439576Z (42/107): perl-Digest-MD5-2.58-2.amzn2023.0.2.x 5.6 MB/s | 36 kB 00:00 2025-05-07T19:42:49.8462194Z (43/107): perl-DynaLoader-1.47-477.amzn2023.0.6 4.1 MB/s | 26 kB 00:00 2025-05-07T19:42:49.8598570Z (44/107): perl-Encode-3.15-462.amzn2023.0.2.x86 95 MB/s | 1.7 MB 00:00 2025-05-07T19:42:49.8613953Z (45/107): perl-Errno-1.30-477.amzn2023.0.6.x86_ 929 kB/s | 15 kB 00:00 2025-05-07T19:42:49.8633541Z (46/107): perl-Error-0.17029-5.amzn2023.0.2.noa 2.8 MB/s | 41 kB 00:00 2025-05-07T19:42:49.8672400Z (47/107): perl-Exporter-5.74-459.amzn2023.0.2.n 5.9 MB/s | 31 kB 00:00 2025-05-07T19:42:49.8682457Z (48/107): perl-Fcntl-1.13-477.amzn2023.0.6.x86_ 3.2 MB/s | 21 kB 00:00 2025-05-07T19:42:49.8708593Z (49/107): perl-File-Basename-2.85-477.amzn2023. 2.5 MB/s | 18 kB 00:00 2025-05-07T19:42:49.8732749Z (50/107): perl-File-Find-1.37-477.amzn2023.0.6. 5.7 MB/s | 26 kB 00:00 2025-05-07T19:42:49.8747757Z (51/107): perl-File-Path-2.18-2.amzn2023.0.2.no 6.1 MB/s | 36 kB 00:00 2025-05-07T19:42:49.8784821Z (52/107): perl-File-stat-1.09-477.amzn2023.0.6. 3.6 MB/s | 17 kB 00:00 2025-05-07T19:42:49.8798308Z (53/107): perl-FileHandle-2.03-477.amzn2023.0.6 3.1 MB/s | 16 kB 00:00 2025-05-07T19:42:49.8826448Z (54/107): perl-File-Temp-0.231.100-2.amzn2023.0 5.2 MB/s | 60 kB 00:00 2025-05-07T19:42:49.8853941Z (55/107): perl-Getopt-Long-2.52-2.amzn2023.0.2. 9.1 MB/s | 60 kB 00:00 2025-05-07T19:42:49.8863954Z (56/107): perl-Getopt-Std-1.12-477.amzn2023.0.6 2.5 MB/s | 16 kB 00:00 2025-05-07T19:42:49.8914318Z (57/107): perl-Git-2.47.1-1.amzn2023.0.2.noarch 5.3 MB/s | 42 kB 00:00 2025-05-07T19:42:49.8935952Z (58/107): perl-HTTP-Tiny-0.078-1.amzn2023.0.3.n 7.9 MB/s | 56 kB 00:00 2025-05-07T19:42:49.8958200Z (59/107): perl-IO-1.43-477.amzn2023.0.6.x86_64. 9.6 MB/s | 87 kB 00:00 2025-05-07T19:42:49.8983587Z (60/107): perl-IO-Socket-IP-0.41-3.amzn2023.0.2 6.9 MB/s | 42 kB 00:00 2025-05-07T19:42:49.9033922Z (61/107): perl-IO-Socket-SSL-2.075-1.amzn2023.0 31 MB/s | 218 kB 00:00 2025-05-07T19:42:49.9044820Z (62/107): perl-IPC-Open3-1.21-477.amzn2023.0.6. 2.8 MB/s | 23 kB 00:00 2025-05-07T19:42:49.9073587Z (63/107): perl-MIME-Base64-3.16-2.amzn2023.0.2. 3.6 MB/s | 31 kB 00:00 2025-05-07T19:42:49.9106789Z (64/107): perl-Mozilla-CA-20200520-4.amzn2023.0 2.6 MB/s | 13 kB 00:00 2025-05-07T19:42:49.9135862Z (65/107): perl-NDBM_File-1.15-477.amzn2023.0.6. 2.9 MB/s | 23 kB 00:00 2025-05-07T19:42:49.9175980Z (66/107): perl-Net-SSLeay-1.94-1.amzn2023.0.1.x 38 MB/s | 392 kB 00:00 2025-05-07T19:42:49.9200073Z (67/107): perl-POSIX-1.94-477.amzn2023.0.6.x86_ 12 MB/s | 97 kB 00:00 2025-05-07T19:42:49.9237163Z (68/107): perl-PathTools-3.78-459.amzn2023.0.2. 15 MB/s | 85 kB 00:00 2025-05-07T19:42:49.9251243Z (69/107): perl-Pod-Escapes-1.07-458.amzn2023.0. 2.9 MB/s | 20 kB 00:00 2025-05-07T19:42:49.9278253Z (70/107): perl-Pod-Perldoc-3.28.01-459.amzn2023 11 MB/s | 84 kB 00:00 2025-05-07T19:42:49.9331130Z (71/107): perl-Pod-Simple-3.42-2.amzn2023.0.2.n 30 MB/s | 215 kB 00:00 2025-05-07T19:42:49.9342115Z (72/107): perl-Pod-Usage-2.01-2.amzn2023.0.2.no 4.8 MB/s | 41 kB 00:00 2025-05-07T19:42:49.9363116Z (73/107): perl-Scalar-List-Utils-1.56-459.amzn2 8.4 MB/s | 71 kB 00:00 2025-05-07T19:42:49.9390545Z (74/107): perl-SelectSaver-1.02-477.amzn2023.0. 3.0 MB/s | 12 kB 00:00 2025-05-07T19:42:49.9424685Z (75/107): perl-Storable-3.21-458.amzn2023.0.2.x 17 MB/s | 96 kB 00:00 2025-05-07T19:42:49.9440283Z (76/107): perl-Socket-2.032-1.amzn2023.0.2.x86_ 5.9 MB/s | 55 kB 00:00 2025-05-07T19:42:49.9449184Z (77/107): perl-Symbol-1.08-477.amzn2023.0.6.noa 2.4 MB/s | 15 kB 00:00 2025-05-07T19:42:49.9473678Z (78/107): perl-Term-ANSIColor-5.01-459.amzn2023 11 MB/s | 48 kB 00:00 2025-05-07T19:42:49.9499445Z (79/107): perl-Term-Cap-1.17-458.amzn2023.0.2.n 5.2 MB/s | 22 kB 00:00 2025-05-07T19:42:49.9521967Z (80/107): perl-TermReadKey-2.38-9.amzn2023.0.2. 5.3 MB/s | 36 kB 00:00 2025-05-07T19:42:49.9535410Z (81/107): perl-Text-ParseWords-3.30-458.amzn202 2.6 MB/s | 17 kB 00:00 2025-05-07T19:42:49.9570669Z (82/107): perl-Text-Tabs+Wrap-2021.0726-1.amzn2 3.5 MB/s | 22 kB 00:00 2025-05-07T19:42:49.9593252Z (83/107): perl-Time-Local-1.300-5.amzn2023.0.2. 5.1 MB/s | 34 kB 00:00 2025-05-07T19:42:49.9611190Z (84/107): perl-URI-5.09-1.amzn2023.0.2.noarch.r 15 MB/s | 108 kB 00:00 2025-05-07T19:42:49.9632976Z (85/107): perl-base-2.27-477.amzn2023.0.6.noarc 3.3 MB/s | 17 kB 00:00 2025-05-07T19:42:49.9663130Z (86/107): perl-constant-1.33-459.amzn2023.0.2.n 5.0 MB/s | 23 kB 00:00 2025-05-07T19:42:49.9675107Z (87/107): perl-if-0.60.800-477.amzn2023.0.6.noa 2.3 MB/s | 14 kB 00:00 2025-05-07T19:42:49.9702564Z (88/107): perl-interpreter-5.32.1-477.amzn2023. 11 MB/s | 71 kB 00:00 2025-05-07T19:42:49.9718605Z (89/107): perl-lib-0.65-477.amzn2023.0.6.x86_64 2.8 MB/s | 15 kB 00:00 2025-05-07T19:42:49.9751166Z (90/107): perl-libnet-3.13-2.amzn2023.0.2.noarc 18 MB/s | 126 kB 00:00 2025-05-07T19:42:49.9949691Z (91/107): perl-libs-5.32.1-477.amzn2023.0.6.x86 85 MB/s | 2.0 MB 00:00 2025-05-07T19:42:49.9963704Z (92/107): perl-mro-1.23-477.amzn2023.0.6.x86_64 1.2 MB/s | 29 kB 00:00 2025-05-07T19:42:49.9982123Z (93/107): perl-overload-1.31-477.amzn2023.0.6.n 2.0 MB/s | 46 kB 00:00 2025-05-07T19:42:49.9998250Z (94/107): perl-overloading-0.02-477.amzn2023.0. 2.7 MB/s | 13 kB 00:00 2025-05-07T19:42:50.0035373Z (95/107): perl-parent-0.238-458.amzn2023.0.2.no 2.9 MB/s | 14 kB 00:00 2025-05-07T19:42:50.0064449Z (96/107): perl-podlators-4.14-458.amzn2023.0.2. 15 MB/s | 112 kB 00:00 2025-05-07T19:42:50.0074694Z (97/107): perl-subs-1.03-477.amzn2023.0.6.noarc 1.6 MB/s | 12 kB 00:00 2025-05-07T19:42:50.0100488Z (98/107): perl-vars-1.05-477.amzn2023.0.6.noarc 2.3 MB/s | 13 kB 00:00 2025-05-07T19:42:50.0213548Z (99/107): sudo-1.9.15-1.p5.amzn2023.0.1.x86_64. 100 MB/s | 1.3 MB 00:00 2025-05-07T19:42:50.0291910Z (100/107): shadow-utils-4.9-12.amzn2023.0.4.x86 54 MB/s | 1.1 MB 00:00 2025-05-07T19:42:50.0306591Z (101/107): sudo-python-plugin-1.9.15-1.p5.amzn2 2.7 MB/s | 56 kB 00:00 2025-05-07T19:42:50.0361339Z (102/107): systemd-libs-252.23-3.amzn2023.x86_6 46 MB/s | 613 kB 00:00 2025-05-07T19:42:50.0441136Z (103/107): tar-1.34-1.amzn2023.0.4.x86_64.rpm 70 MB/s | 879 kB 00:00 2025-05-07T19:42:50.0579969Z (104/107): util-linux-2.37.4-1.amzn2023.0.4.x86 84 MB/s | 2.2 MB 00:00 2025-05-07T19:42:50.0633161Z (105/107): util-linux-core-2.37.4-1.amzn2023.0. 17 MB/s | 432 kB 00:00 2025-05-07T19:42:50.0681200Z (106/107): wget-1.21.3-1.amzn2023.0.4.x86_64.rp 36 MB/s | 779 kB 00:00 2025-05-07T19:42:50.0700122Z (107/107): which-2.21-26.amzn2023.0.2.x86_64.rp 4.1 MB/s | 42 kB 00:00 2025-05-07T19:42:50.0717401Z -------------------------------------------------------------------------------- 2025-05-07T19:42:50.0718279Z Total 50 MB/s | 38 MB 00:00 2025-05-07T19:42:51.1328475Z Running transaction check 2025-05-07T19:42:51.1799831Z Transaction check succeeded. 2025-05-07T19:42:51.1800188Z Running transaction test 2025-05-07T19:42:51.5494600Z Transaction test succeeded. 2025-05-07T19:42:51.5495044Z Running transaction 2025-05-07T19:42:52.3842212Z Preparing : 1/1 2025-05-07T19:42:52.4017775Z Installing : systemd-libs-252.23-3.amzn2023.x86_64 1/107 2025-05-07T19:42:52.4272962Z Installing : nettle-3.10.1-1.amzn2023.0.1.x86_64 2/107 2025-05-07T19:42:52.4492669Z Installing : gnutls-3.8.3-6.amzn2023.0.1.x86_64 3/107 2025-05-07T19:42:52.4565900Z Installing : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:52.4640670Z Running scriptlet: util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 4/107 2025-05-07T19:42:52.4749864Z Installing : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:52.5043837Z Installing : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 6/107 2025-05-07T19:42:52.5134466Z Installing : nano-8.3-1.amzn2023.x86_64 7/107 2025-05-07T19:42:52.5201107Z Installing : nano-default-editor-8.3-1.amzn2023.noarch 8/107 2025-05-07T19:42:52.5710158Z Installing : libsemanage-3.4-5.amzn2023.0.2.x86_64 9/107 2025-05-07T19:42:52.5804165Z Installing : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 10/107 2025-05-07T19:42:52.6255601Z Running scriptlet: libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:52.6321923Z Installing : libutempter-1.2.1-4.amzn2023.0.2.x86_64 11/107 2025-05-07T19:42:52.6393507Z Installing : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 12/107 2025-05-07T19:42:52.6463743Z Installing : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 13/107 2025-05-07T19:42:52.6521519Z Installing : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 14/107 2025-05-07T19:42:52.6675053Z Installing : libeconf-0.4.0-1.amzn2023.0.3.x86_64 15/107 2025-05-07T19:42:52.6740555Z Installing : libdb-5.3.28-49.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:52.6811134Z Installing : libcbor-0.7.0-3.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:52.6893992Z Installing : libfido2-1.10.0-2.amzn2023.0.2.x86_64 18/107 2025-05-07T19:42:52.6961367Z Installing : less-608-2.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:52.7021751Z Installing : kmod-libs-29-2.amzn2023.0.5.x86_64 20/107 2025-05-07T19:42:52.7458645Z Installing : jansson-2.14-0.amzn2023.x86_64 21/107 2025-05-07T19:42:52.7548892Z Installing : hwdata-0.384-1.amzn2023.0.3.noarch 22/107 2025-05-07T19:42:52.7704522Z Installing : gzip-1.12-1.amzn2023.0.1.x86_64 23/107 2025-05-07T19:42:52.8156195Z Installing : cracklib-2.9.6-27.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:52.8344576Z Installing : pam-1.5.1-8.amzn2023.0.4.x86_64 25/107 2025-05-07T19:42:52.9174545Z Installing : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 26/107 2025-05-07T19:42:52.9176277Z Installing : util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:52.9177676Z warning: /etc/adjtime created as /etc/adjtime.rpmnew 2025-05-07T19:42:52.9178435Z 2025-05-07T19:42:52.9386962Z Running scriptlet: util-linux-2.37.4-1.amzn2023.0.4.x86_64 27/107 2025-05-07T19:42:52.9739386Z Running scriptlet: openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:52.9935216Z Installing : openssh-8.7p1-8.amzn2023.0.14.x86_64 28/107 2025-05-07T19:42:53.0016539Z Installing : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:53.1126266Z Running scriptlet: openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 29/107 2025-05-07T19:42:53.2630132Z Installing : git-core-2.47.1-1.amzn2023.0.2.x86_64 30/107 2025-05-07T19:42:53.2766148Z Installing : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 31/107 2025-05-07T19:42:53.3188126Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:53.3271606Z Installing : groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:53.3342416Z Running scriptlet: groff-base-1.22.4-7.amzn2023.0.2.x86_64 32/107 2025-05-07T19:42:53.3423660Z Installing : perl-Digest-1.20-1.amzn2023.0.2.noarch 33/107 2025-05-07T19:42:53.3513225Z Installing : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:53.3572143Z Installing : perl-B-1.80-477.amzn2023.0.6.x86_64 35/107 2025-05-07T19:42:53.3620692Z Installing : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:53.3675975Z Installing : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 37/107 2025-05-07T19:42:53.3766399Z Installing : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 38/107 2025-05-07T19:42:53.3833412Z Installing : perl-libnet-3.13-2.amzn2023.0.2.noarch 39/107 2025-05-07T19:42:53.3937469Z Installing : perl-base-2.27-477.amzn2023.0.6.noarch 40/107 2025-05-07T19:42:53.4150801Z Installing : perl-URI-5.09-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:53.4238844Z Installing : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 42/107 2025-05-07T19:42:53.4296146Z Installing : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 43/107 2025-05-07T19:42:53.4342660Z Installing : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 44/107 2025-05-07T19:42:53.4397429Z Installing : perl-if-0.60.800-477.amzn2023.0.6.noarch 45/107 2025-05-07T19:42:53.4455808Z Installing : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:53.4515859Z Installing : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:53.4604935Z Installing : perl-File-Path-2.18-2.amzn2023.0.2.noarch 48/107 2025-05-07T19:42:53.4669661Z Installing : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 49/107 2025-05-07T19:42:53.4715191Z Installing : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 50/107 2025-05-07T19:42:53.4778084Z Installing : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 51/107 2025-05-07T19:42:53.4842812Z Installing : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 52/107 2025-05-07T19:42:53.4898696Z Installing : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 53/107 2025-05-07T19:42:53.4943289Z Installing : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:53.5001031Z Installing : perl-subs-1.03-477.amzn2023.0.6.noarch 55/107 2025-05-07T19:42:53.5070117Z Installing : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 56/107 2025-05-07T19:42:53.5130341Z Installing : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 57/107 2025-05-07T19:42:53.5233428Z Installing : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 58/107 2025-05-07T19:42:53.5318183Z Installing : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 59/107 2025-05-07T19:42:53.5383289Z Installing : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 60/107 2025-05-07T19:42:53.5433871Z Installing : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 61/107 2025-05-07T19:42:53.5481669Z Installing : perl-Symbol-1.08-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:53.5563308Z Installing : perl-File-stat-1.09-477.amzn2023.0.6.noarch 63/107 2025-05-07T19:42:53.5663323Z Installing : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:53.5738264Z Installing : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 65/107 2025-05-07T19:42:53.5796406Z Installing : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 66/107 2025-05-07T19:42:53.5858435Z Installing : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 67/107 2025-05-07T19:42:53.5934781Z Installing : perl-mro-1.23-477.amzn2023.0.6.x86_64 68/107 2025-05-07T19:42:53.5995794Z Installing : perl-IO-1.43-477.amzn2023.0.6.x86_64 69/107 2025-05-07T19:42:53.6058368Z Installing : perl-overloading-0.02-477.amzn2023.0.6.noarch 70/107 2025-05-07T19:42:53.6124819Z Installing : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:53.6173873Z Installing : perl-Errno-1.30-477.amzn2023.0.6.x86_64 72/107 2025-05-07T19:42:53.6222480Z Installing : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 73/107 2025-05-07T19:42:53.6282510Z Installing : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:53.6356499Z Installing : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:53.6435811Z Installing : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 76/107 2025-05-07T19:42:53.6502112Z Installing : perl-constant-1.33-459.amzn2023.0.2.noarch 77/107 2025-05-07T19:42:53.6564113Z Installing : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 78/107 2025-05-07T19:42:53.6622844Z Installing : perl-overload-1.31-477.amzn2023.0.6.noarch 79/107 2025-05-07T19:42:53.6673904Z Installing : perl-parent-1:0.238-458.amzn2023.0.2.noarch 80/107 2025-05-07T19:42:53.6735362Z Installing : perl-vars-1.05-477.amzn2023.0.6.noarch 81/107 2025-05-07T19:42:53.6793383Z Installing : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 82/107 2025-05-07T19:42:53.6850244Z Installing : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 83/107 2025-05-07T19:42:53.6902418Z Installing : perl-Carp-1.50-458.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:53.6956369Z Installing : perl-Exporter-5.74-459.amzn2023.0.2.noarch 85/107 2025-05-07T19:42:53.7034229Z Installing : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 86/107 2025-05-07T19:42:53.7576195Z Installing : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 87/107 2025-05-07T19:42:53.8548600Z Installing : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 88/107 2025-05-07T19:42:53.8677484Z Installing : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:53.8757604Z Installing : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 90/107 2025-05-07T19:42:53.8827228Z Installing : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 91/107 2025-05-07T19:42:53.8895884Z Installing : perl-File-Find-1.37-477.amzn2023.0.6.noarch 92/107 2025-05-07T19:42:53.8964035Z Installing : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 93/107 2025-05-07T19:42:53.9019361Z Installing : perl-lib-0.65-477.amzn2023.0.6.x86_64 94/107 2025-05-07T19:42:53.9080614Z Installing : perl-Git-2.47.1-1.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:53.9163879Z Installing : git-2.47.1-1.amzn2023.0.2.x86_64 96/107 2025-05-07T19:42:53.9370426Z Installing : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 97/107 2025-05-07T19:42:53.9497292Z Installing : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 98/107 2025-05-07T19:42:53.9580703Z Installing : openldap-2.4.57-6.amzn2023.0.7.x86_64 99/107 2025-05-07T19:42:53.9980292Z Installing : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 100/107 2025-05-07T19:42:54.1202249Z Installing : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 101/107 2025-05-07T19:42:54.1297134Z Installing : binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:54.1417011Z Running scriptlet: binutils-2.41-50.amzn2023.0.3.x86_64 102/107 2025-05-07T19:42:54.1720050Z Installing : pciutils-3.7.0-3.amzn2023.0.2.x86_64 103/107 2025-05-07T19:42:54.1817957Z Installing : wget-1.21.3-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:54.2063510Z Installing : which-2.21-26.amzn2023.0.2.x86_64 105/107 2025-05-07T19:42:54.2276339Z Installing : tar-2:1.34-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:54.2364028Z Installing : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:54.2481926Z Running scriptlet: pam-1.5.1-8.amzn2023.0.4.x86_64 107/107 2025-05-07T19:42:55.0179707Z Running scriptlet: findutils-1:4.8.0-2.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:55.0182091Z Verifying : binutils-2.41-50.amzn2023.0.3.x86_64 1/107 2025-05-07T19:42:55.0182774Z Verifying : cracklib-2.9.6-27.amzn2023.0.2.x86_64 2/107 2025-05-07T19:42:55.0183442Z Verifying : cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 3/107 2025-05-07T19:42:55.0184171Z Verifying : elfutils-debuginfod-client-0.188-3.amzn2023.0.2. 4/107 2025-05-07T19:42:55.0184802Z Verifying : findutils-1:4.8.0-2.amzn2023.0.2.x86_64 5/107 2025-05-07T19:42:55.0185468Z Verifying : git-2.47.1-1.amzn2023.0.2.x86_64 6/107 2025-05-07T19:42:55.0186128Z Verifying : git-core-2.47.1-1.amzn2023.0.2.x86_64 7/107 2025-05-07T19:42:55.0186732Z Verifying : git-core-doc-2.47.1-1.amzn2023.0.2.noarch 8/107 2025-05-07T19:42:55.0187789Z Verifying : gnutls-3.8.3-6.amzn2023.0.1.x86_64 9/107 2025-05-07T19:42:55.0188406Z Verifying : groff-base-1.22.4-7.amzn2023.0.2.x86_64 10/107 2025-05-07T19:42:55.0189074Z Verifying : gzip-1.12-1.amzn2023.0.1.x86_64 11/107 2025-05-07T19:42:55.0189805Z Verifying : hwdata-0.384-1.amzn2023.0.3.noarch 12/107 2025-05-07T19:42:55.0190401Z Verifying : jansson-2.14-0.amzn2023.x86_64 13/107 2025-05-07T19:42:55.0191055Z Verifying : kmod-libs-29-2.amzn2023.0.5.x86_64 14/107 2025-05-07T19:42:55.0191658Z Verifying : less-608-2.amzn2023.0.2.x86_64 15/107 2025-05-07T19:42:55.0192332Z Verifying : libcbor-0.7.0-3.amzn2023.0.2.x86_64 16/107 2025-05-07T19:42:55.0193115Z Verifying : libdb-5.3.28-49.amzn2023.0.2.x86_64 17/107 2025-05-07T19:42:55.0193807Z Verifying : libeconf-0.4.0-1.amzn2023.0.3.x86_64 18/107 2025-05-07T19:42:55.0194491Z Verifying : libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 19/107 2025-05-07T19:42:55.0195095Z Verifying : libfdisk-2.37.4-1.amzn2023.0.4.x86_64 20/107 2025-05-07T19:42:55.0195792Z Verifying : libfido2-1.10.0-2.amzn2023.0.2.x86_64 21/107 2025-05-07T19:42:55.0196487Z Verifying : libmetalink-0.1.3-14.amzn2023.0.2.x86_64 22/107 2025-05-07T19:42:55.0197113Z Verifying : libpwquality-1.4.4-6.amzn2023.0.2.x86_64 23/107 2025-05-07T19:42:55.0197838Z Verifying : libsemanage-3.4-5.amzn2023.0.2.x86_64 24/107 2025-05-07T19:42:55.0198474Z Verifying : libutempter-1.2.1-4.amzn2023.0.2.x86_64 25/107 2025-05-07T19:42:55.0199142Z Verifying : nano-8.3-1.amzn2023.x86_64 26/107 2025-05-07T19:42:55.0199811Z Verifying : nano-default-editor-8.3-1.amzn2023.noarch 27/107 2025-05-07T19:42:55.0200461Z Verifying : ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 28/107 2025-05-07T19:42:55.0201194Z Verifying : nettle-3.10.1-1.amzn2023.0.1.x86_64 29/107 2025-05-07T19:42:55.0201806Z Verifying : openldap-2.4.57-6.amzn2023.0.7.x86_64 30/107 2025-05-07T19:42:55.0202493Z Verifying : openssh-8.7p1-8.amzn2023.0.14.x86_64 31/107 2025-05-07T19:42:55.0203162Z Verifying : openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 32/107 2025-05-07T19:42:55.0203785Z Verifying : pam-1.5.1-8.amzn2023.0.4.x86_64 33/107 2025-05-07T19:42:55.0204451Z Verifying : pciutils-3.7.0-3.amzn2023.0.2.x86_64 34/107 2025-05-07T19:42:55.0205058Z Verifying : pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 35/107 2025-05-07T19:42:55.0205750Z Verifying : perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 36/107 2025-05-07T19:42:55.0206628Z Verifying : perl-B-1.80-477.amzn2023.0.6.x86_64 37/107 2025-05-07T19:42:55.0207233Z Verifying : perl-Carp-1.50-458.amzn2023.0.2.noarch 38/107 2025-05-07T19:42:55.0207930Z Verifying : perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 39/107 2025-05-07T19:42:55.0208591Z Verifying : perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 40/107 2025-05-07T19:42:55.0209350Z Verifying : perl-Digest-1.20-1.amzn2023.0.2.noarch 41/107 2025-05-07T19:42:55.0210028Z Verifying : perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 42/107 2025-05-07T19:42:55.0210674Z Verifying : perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 43/107 2025-05-07T19:42:55.0211344Z Verifying : perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 44/107 2025-05-07T19:42:55.0211920Z Verifying : perl-Errno-1.30-477.amzn2023.0.6.x86_64 45/107 2025-05-07T19:42:55.0212715Z Verifying : perl-Error-1:0.17029-5.amzn2023.0.2.noarch 46/107 2025-05-07T19:42:55.0213330Z Verifying : perl-Exporter-5.74-459.amzn2023.0.2.noarch 47/107 2025-05-07T19:42:55.0213987Z Verifying : perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 48/107 2025-05-07T19:42:55.0214795Z Verifying : perl-File-Basename-2.85-477.amzn2023.0.6.noarch 49/107 2025-05-07T19:42:55.0215427Z Verifying : perl-File-Find-1.37-477.amzn2023.0.6.noarch 50/107 2025-05-07T19:42:55.0216056Z Verifying : perl-File-Path-2.18-2.amzn2023.0.2.noarch 51/107 2025-05-07T19:42:55.0216651Z Verifying : perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 52/107 2025-05-07T19:42:55.0217246Z Verifying : perl-File-stat-1.09-477.amzn2023.0.6.noarch 53/107 2025-05-07T19:42:55.0217821Z Verifying : perl-FileHandle-2.03-477.amzn2023.0.6.noarch 54/107 2025-05-07T19:42:55.0218455Z Verifying : perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 55/107 2025-05-07T19:42:55.0219019Z Verifying : perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 56/107 2025-05-07T19:42:55.0219588Z Verifying : perl-Git-2.47.1-1.amzn2023.0.2.noarch 57/107 2025-05-07T19:42:55.0220142Z Verifying : perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 58/107 2025-05-07T19:42:55.0220676Z Verifying : perl-IO-1.43-477.amzn2023.0.6.x86_64 59/107 2025-05-07T19:42:55.0221235Z Verifying : perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 60/107 2025-05-07T19:42:55.0221796Z Verifying : perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 61/107 2025-05-07T19:42:55.0222380Z Verifying : perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 62/107 2025-05-07T19:42:55.0222949Z Verifying : perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 63/107 2025-05-07T19:42:55.0223511Z Verifying : perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 64/107 2025-05-07T19:42:55.0224091Z Verifying : perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 65/107 2025-05-07T19:42:55.0224633Z Verifying : perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 66/107 2025-05-07T19:42:55.0225201Z Verifying : perl-POSIX-1.94-477.amzn2023.0.6.x86_64 67/107 2025-05-07T19:42:55.0225755Z Verifying : perl-PathTools-3.78-459.amzn2023.0.2.x86_64 68/107 2025-05-07T19:42:55.0226329Z Verifying : perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 69/107 2025-05-07T19:42:55.0226909Z Verifying : perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 70/107 2025-05-07T19:42:55.0227466Z Verifying : perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 71/107 2025-05-07T19:42:55.0228039Z Verifying : perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 72/107 2025-05-07T19:42:55.0228701Z Verifying : perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x 73/107 2025-05-07T19:42:55.0229476Z Verifying : perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 74/107 2025-05-07T19:42:55.0229987Z Verifying : perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 75/107 2025-05-07T19:42:55.0230495Z Verifying : perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 76/107 2025-05-07T19:42:55.0231007Z Verifying : perl-Symbol-1.08-477.amzn2023.0.6.noarch 77/107 2025-05-07T19:42:55.0231959Z Verifying : perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 78/107 2025-05-07T19:42:55.0232615Z Verifying : perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 79/107 2025-05-07T19:42:55.0233165Z Verifying : perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 80/107 2025-05-07T19:42:55.0233777Z Verifying : perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarc 81/107 2025-05-07T19:42:55.0234361Z Verifying : perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noa 82/107 2025-05-07T19:42:55.0234909Z Verifying : perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 83/107 2025-05-07T19:42:55.0235633Z Verifying : perl-URI-5.09-1.amzn2023.0.2.noarch 84/107 2025-05-07T19:42:55.0236164Z Verifying : perl-base-2.27-477.amzn2023.0.6.noarch 85/107 2025-05-07T19:42:55.0236724Z Verifying : perl-constant-1.33-459.amzn2023.0.2.noarch 86/107 2025-05-07T19:42:55.0237261Z Verifying : perl-if-0.60.800-477.amzn2023.0.6.noarch 87/107 2025-05-07T19:42:55.0237817Z Verifying : perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_6 88/107 2025-05-07T19:42:55.0238365Z Verifying : perl-lib-0.65-477.amzn2023.0.6.x86_64 89/107 2025-05-07T19:42:55.0238902Z Verifying : perl-libnet-3.13-2.amzn2023.0.2.noarch 90/107 2025-05-07T19:42:55.0239449Z Verifying : perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 91/107 2025-05-07T19:42:55.0239971Z Verifying : perl-mro-1.23-477.amzn2023.0.6.x86_64 92/107 2025-05-07T19:42:55.0240530Z Verifying : perl-overload-1.31-477.amzn2023.0.6.noarch 93/107 2025-05-07T19:42:55.0241088Z Verifying : perl-overloading-0.02-477.amzn2023.0.6.noarch 94/107 2025-05-07T19:42:55.0241652Z Verifying : perl-parent-1:0.238-458.amzn2023.0.2.noarch 95/107 2025-05-07T19:42:55.0242208Z Verifying : perl-podlators-1:4.14-458.amzn2023.0.2.noarch 96/107 2025-05-07T19:42:55.0242745Z Verifying : perl-subs-1.03-477.amzn2023.0.6.noarch 97/107 2025-05-07T19:42:55.0243290Z Verifying : perl-vars-1.05-477.amzn2023.0.6.noarch 98/107 2025-05-07T19:42:55.0243824Z Verifying : shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 99/107 2025-05-07T19:42:55.0244359Z Verifying : sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 100/107 2025-05-07T19:42:55.0244920Z Verifying : sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_ 101/107 2025-05-07T19:42:55.0245595Z Verifying : systemd-libs-252.23-3.amzn2023.x86_64 102/107 2025-05-07T19:42:55.0246115Z Verifying : tar-2:1.34-1.amzn2023.0.4.x86_64 103/107 2025-05-07T19:42:55.0246610Z Verifying : util-linux-2.37.4-1.amzn2023.0.4.x86_64 104/107 2025-05-07T19:42:55.0247152Z Verifying : util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 105/107 2025-05-07T19:42:55.0247663Z Verifying : wget-1.21.3-1.amzn2023.0.4.x86_64 106/107 2025-05-07T19:42:55.1215651Z Verifying : which-2.21-26.amzn2023.0.2.x86_64 107/107 2025-05-07T19:42:55.1216653Z 2025-05-07T19:42:55.1217012Z Installed: 2025-05-07T19:42:55.1217992Z binutils-2.41-50.amzn2023.0.3.x86_64 2025-05-07T19:42:55.1219553Z cracklib-2.9.6-27.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1221149Z cyrus-sasl-lib-2.1.27-18.amzn2023.0.3.x86_64 2025-05-07T19:42:55.1223062Z elfutils-debuginfod-client-0.188-3.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1223663Z findutils-1:4.8.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1224170Z git-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1224692Z git-core-2.47.1-1.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1225242Z git-core-doc-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:55.1225772Z gnutls-3.8.3-6.amzn2023.0.1.x86_64 2025-05-07T19:42:55.1226312Z groff-base-1.22.4-7.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1226826Z gzip-1.12-1.amzn2023.0.1.x86_64 2025-05-07T19:42:55.1227358Z hwdata-0.384-1.amzn2023.0.3.noarch 2025-05-07T19:42:55.1228005Z jansson-2.14-0.amzn2023.x86_64 2025-05-07T19:42:55.1228550Z kmod-libs-29-2.amzn2023.0.5.x86_64 2025-05-07T19:42:55.1229197Z less-608-2.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1229691Z libcbor-0.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1230208Z libdb-5.3.28-49.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1230710Z libeconf-0.4.0-1.amzn2023.0.3.x86_64 2025-05-07T19:42:55.1231633Z libedit-3.1-38.20210714cvs.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1232322Z libfdisk-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1233048Z libfido2-1.10.0-2.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1233592Z libmetalink-0.1.3-14.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1234171Z libpwquality-1.4.4-6.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1234725Z libsemanage-3.4-5.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1235289Z libutempter-1.2.1-4.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1235823Z nano-8.3-1.amzn2023.x86_64 2025-05-07T19:42:55.1236357Z nano-default-editor-8.3-1.amzn2023.noarch 2025-05-07T19:42:55.1236945Z ncurses-6.2-4.20200222.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1237477Z nettle-3.10.1-1.amzn2023.0.1.x86_64 2025-05-07T19:42:55.1238025Z openldap-2.4.57-6.amzn2023.0.7.x86_64 2025-05-07T19:42:55.1238565Z openssh-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:55.1239249Z openssh-clients-8.7p1-8.amzn2023.0.14.x86_64 2025-05-07T19:42:55.1239761Z pam-1.5.1-8.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1240236Z pciutils-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1240777Z pciutils-libs-3.7.0-3.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1241324Z perl-AutoLoader-5.74-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1241872Z perl-B-1.80-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1242405Z perl-Carp-1.50-458.amzn2023.0.2.noarch 2025-05-07T19:42:55.1243025Z perl-Class-Struct-0.66-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1243666Z perl-Data-Dumper-2.174-460.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1244384Z perl-Digest-1.20-1.amzn2023.0.2.noarch 2025-05-07T19:42:55.1244966Z perl-Digest-MD5-2.58-2.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1245513Z perl-DynaLoader-1.47-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1246085Z perl-Encode-4:3.15-462.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1246673Z perl-Errno-1.30-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1247420Z perl-Error-1:0.17029-5.amzn2023.0.2.noarch 2025-05-07T19:42:55.1248011Z perl-Exporter-5.74-459.amzn2023.0.2.noarch 2025-05-07T19:42:55.1248552Z perl-Fcntl-1.13-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1249144Z perl-File-Basename-2.85-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1249841Z perl-File-Find-1.37-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1250389Z perl-File-Path-2.18-2.amzn2023.0.2.noarch 2025-05-07T19:42:55.1250942Z perl-File-Temp-1:0.231.100-2.amzn2023.0.2.noarch 2025-05-07T19:42:55.1251464Z perl-File-stat-1.09-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1252025Z perl-FileHandle-2.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1252561Z perl-Getopt-Long-1:2.52-2.amzn2023.0.2.noarch 2025-05-07T19:42:55.1253115Z perl-Getopt-Std-1.12-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1253661Z perl-Git-2.47.1-1.amzn2023.0.2.noarch 2025-05-07T19:42:55.1254183Z perl-HTTP-Tiny-0.078-1.amzn2023.0.3.noarch 2025-05-07T19:42:55.1254724Z perl-IO-1.43-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1255243Z perl-IO-Socket-IP-0.41-3.amzn2023.0.2.noarch 2025-05-07T19:42:55.1255811Z perl-IO-Socket-SSL-2.075-1.amzn2023.0.2.noarch 2025-05-07T19:42:55.1256355Z perl-IPC-Open3-1.21-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1256912Z perl-MIME-Base64-3.16-2.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1257486Z perl-Mozilla-CA-20200520-4.amzn2023.0.2.noarch 2025-05-07T19:42:55.1258028Z perl-NDBM_File-1.15-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1258579Z perl-Net-SSLeay-1.94-1.amzn2023.0.1.x86_64 2025-05-07T19:42:55.1259110Z perl-POSIX-1.94-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1259669Z perl-PathTools-3.78-459.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1260236Z perl-Pod-Escapes-1:1.07-458.amzn2023.0.2.noarch 2025-05-07T19:42:55.1260782Z perl-Pod-Perldoc-3.28.01-459.amzn2023.0.3.noarch 2025-05-07T19:42:55.1261344Z perl-Pod-Simple-1:3.42-2.amzn2023.0.2.noarch 2025-05-07T19:42:55.1261868Z perl-Pod-Usage-4:2.01-2.amzn2023.0.2.noarch 2025-05-07T19:42:55.1262426Z perl-Scalar-List-Utils-4:1.56-459.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1262982Z perl-SelectSaver-1.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1263544Z perl-Socket-4:2.032-1.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1264087Z perl-Storable-1:3.21-458.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1264619Z perl-Symbol-1.08-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1265275Z perl-Term-ANSIColor-5.01-459.amzn2023.0.2.noarch 2025-05-07T19:42:55.1265842Z perl-Term-Cap-1.17-458.amzn2023.0.2.noarch 2025-05-07T19:42:55.1266410Z perl-TermReadKey-2.38-9.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1266980Z perl-Text-ParseWords-3.30-458.amzn2023.0.2.noarch 2025-05-07T19:42:55.1267538Z perl-Text-Tabs+Wrap-2021.0726-1.amzn2023.0.1.noarch 2025-05-07T19:42:55.1268089Z perl-Time-Local-2:1.300-5.amzn2023.0.2.noarch 2025-05-07T19:42:55.1268597Z perl-URI-5.09-1.amzn2023.0.2.noarch 2025-05-07T19:42:55.1269115Z perl-base-2.27-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1269639Z perl-constant-1.33-459.amzn2023.0.2.noarch 2025-05-07T19:42:55.1270174Z perl-if-0.60.800-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1270846Z perl-interpreter-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1271357Z perl-lib-0.65-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1271887Z perl-libnet-3.13-2.amzn2023.0.2.noarch 2025-05-07T19:42:55.1272395Z perl-libs-4:5.32.1-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1273203Z perl-mro-1.23-477.amzn2023.0.6.x86_64 2025-05-07T19:42:55.1273812Z perl-overload-1.31-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1274423Z perl-overloading-0.02-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1275022Z perl-parent-1:0.238-458.amzn2023.0.2.noarch 2025-05-07T19:42:55.1275588Z perl-podlators-1:4.14-458.amzn2023.0.2.noarch 2025-05-07T19:42:55.1276172Z perl-subs-1.03-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1276725Z perl-vars-1.05-477.amzn2023.0.6.noarch 2025-05-07T19:42:55.1277290Z shadow-utils-2:4.9-12.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1277835Z sudo-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:55.1278383Z sudo-python-plugin-1.9.15-1.p5.amzn2023.0.1.x86_64 2025-05-07T19:42:55.1278966Z systemd-libs-252.23-3.amzn2023.x86_64 2025-05-07T19:42:55.1279581Z tar-2:1.34-1.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1280072Z util-linux-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1280582Z util-linux-core-2.37.4-1.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1281092Z wget-1.21.3-1.amzn2023.0.4.x86_64 2025-05-07T19:42:55.1281611Z which-2.21-26.amzn2023.0.2.x86_64 2025-05-07T19:42:55.1281905Z 2025-05-07T19:42:55.1281990Z Complete! 2025-05-07T19:42:55.2032318Z ##[group]Run actions/checkout@v4 2025-05-07T19:42:55.2032807Z with: 2025-05-07T19:42:55.2033082Z submodules: true 2025-05-07T19:42:55.2033353Z repository: pytorch/FBGEMM 2025-05-07T19:42:55.2033884Z token: *** 2025-05-07T19:42:55.2034119Z ssh-strict: true 2025-05-07T19:42:55.2034404Z ssh-user: git 2025-05-07T19:42:55.2034661Z persist-credentials: true 2025-05-07T19:42:55.2034978Z clean: true 2025-05-07T19:42:55.2035268Z sparse-checkout-cone-mode: true 2025-05-07T19:42:55.2035578Z fetch-depth: 1 2025-05-07T19:42:55.2035854Z fetch-tags: false 2025-05-07T19:42:55.2036119Z show-progress: true 2025-05-07T19:42:55.2036406Z lfs: false 2025-05-07T19:42:55.2036654Z set-safe-directory: true 2025-05-07T19:42:55.2037184Z env: 2025-05-07T19:42:55.2037436Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:55.2037807Z BUILD_ENV: build_binary 2025-05-07T19:42:55.2038097Z BUILD_TARGET: default 2025-05-07T19:42:55.2038398Z BUILD_VARIANT: cuda 2025-05-07T19:42:55.2038786Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:55.2039074Z ##[endgroup] 2025-05-07T19:42:55.2085203Z ##[command]/usr/bin/docker exec 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa sh -c "cat /etc/*release | grep ^ID" 2025-05-07T19:42:55.5111774Z Syncing repository: pytorch/FBGEMM 2025-05-07T19:42:55.5113680Z ##[group]Getting Git version info 2025-05-07T19:42:55.5114057Z Working directory is '/__w/FBGEMM/FBGEMM' 2025-05-07T19:42:55.5114633Z [command]/usr/bin/git version 2025-05-07T19:42:55.5114911Z git version 2.47.1 2025-05-07T19:42:55.5115878Z ##[endgroup] 2025-05-07T19:42:55.5120270Z Temporarily overriding HOME='/__w/_temp/cca1f529-b1ed-445d-8424-f064d87de5f2' before making global git config changes 2025-05-07T19:42:55.5121126Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T19:42:55.5122275Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T19:42:55.5152132Z [command]/usr/bin/git config --local --get remote.origin.url 2025-05-07T19:42:55.5172424Z https://github.com/pytorch/FBGEMM 2025-05-07T19:42:55.5184292Z ##[group]Removing previously created refs, to avoid conflicts 2025-05-07T19:42:55.5186527Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-05-07T19:42:55.5214471Z HEAD 2025-05-07T19:42:55.5252476Z ##[endgroup] 2025-05-07T19:42:55.5252763Z [command]/usr/bin/git submodule status 2025-05-07T19:42:55.5588862Z e5d7c0bd5d9aec44d68830187138149e6a8c4e32 external/asmjit (e5d7c0b) 2025-05-07T19:42:55.5643299Z 4a61bdd4bd4ed730e078aebc7c0fcf046ff29406 external/composable_kernel (4a61bdd) 2025-05-07T19:42:55.5706292Z 6543fec09b2f04ac4a666882998b534afc9c1349 external/cpuinfo (6543fec) 2025-05-07T19:42:55.5775489Z 3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3 external/cutlass (3ed8d2e) 2025-05-07T19:42:55.5845602Z f8d7d77c06936315286eb55f8de22cd23c188571 external/googletest (f8d7d77) 2025-05-07T19:42:55.5915532Z 420084499c7c1e1c2d801922f40df202eac5f3a0 external/hipify_torch (4200844) 2025-05-07T19:42:55.5971331Z 9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03 external/json (9cca280) 2025-05-07T19:42:55.5975714Z ##[group]Cleaning the repository 2025-05-07T19:42:55.5979167Z [command]/usr/bin/git clean -ffdx 2025-05-07T19:42:55.7507795Z Removing build_only/ 2025-05-07T19:42:55.7508245Z Removing collect_env.py 2025-05-07T19:42:55.7508556Z Removing fbgemm_gpu/_skbuild/ 2025-05-07T19:42:55.7508924Z Removing fbgemm_gpu/codegen/genscript/__pycache__/ 2025-05-07T19:42:55.7509318Z Removing fbgemm_gpu/dist/ 2025-05-07T19:42:55.7509687Z Removing fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:42:55.7510110Z Removing fbgemm_gpu/fbgemm_gpu_genai_nightly.egg-info/ 2025-05-07T19:42:55.7510521Z Removing fbgemm_gpu/fbgemm_gpu_nightly.egg-info/ 2025-05-07T19:42:55.7515214Z [command]/usr/bin/git reset --hard HEAD 2025-05-07T19:42:55.8553344Z HEAD is now at 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:55.8555598Z ##[endgroup] 2025-05-07T19:42:55.8557576Z ##[group]Disabling automatic garbage collection 2025-05-07T19:42:55.8562122Z [command]/usr/bin/git config --local gc.auto 0 2025-05-07T19:42:55.8589068Z ##[endgroup] 2025-05-07T19:42:55.8594292Z ##[group]Setting up auth 2025-05-07T19:42:55.8594755Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T19:42:55.8616844Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T19:42:55.8882343Z Entering 'external/asmjit' 2025-05-07T19:42:55.8947628Z Entering 'external/composable_kernel' 2025-05-07T19:42:55.9006773Z Entering 'external/cpuinfo' 2025-05-07T19:42:55.9057707Z Entering 'external/cutlass' 2025-05-07T19:42:55.9119278Z Entering 'external/googletest' 2025-05-07T19:42:55.9173694Z Entering 'external/hipify_torch' 2025-05-07T19:42:55.9219520Z Entering 'external/json' 2025-05-07T19:42:55.9277538Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T19:42:55.9310103Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T19:42:55.9565242Z Entering 'external/asmjit' 2025-05-07T19:42:55.9613542Z Entering 'external/composable_kernel' 2025-05-07T19:42:55.9667586Z Entering 'external/cpuinfo' 2025-05-07T19:42:55.9724220Z Entering 'external/cutlass' 2025-05-07T19:42:55.9785196Z Entering 'external/googletest' 2025-05-07T19:42:55.9848710Z Entering 'external/hipify_torch' 2025-05-07T19:42:55.9905740Z Entering 'external/json' 2025-05-07T19:42:55.9987513Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:56.0032616Z ##[endgroup] 2025-05-07T19:42:56.0033120Z ##[group]Fetching the repository 2025-05-07T19:42:56.0040317Z [command]/usr/bin/git -c protocol.version=2 fetch --no-tags --prune --no-recurse-submodules --depth=1 origin +a2f4c52051596e74bc8c16e3d2867a4ecdd271e0:refs/remotes/pull/4066/merge 2025-05-07T19:42:56.1664735Z From https://github.com/pytorch/FBGEMM 2025-05-07T19:42:56.1666503Z + 1c9ad64...a2f4c52 a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 -> pull/4066/merge (forced update) 2025-05-07T19:42:56.1698052Z ##[endgroup] 2025-05-07T19:42:56.1699194Z ##[group]Determining the checkout info 2025-05-07T19:42:56.1732454Z ##[endgroup] 2025-05-07T19:42:56.1732806Z [command]/usr/bin/git sparse-checkout disable 2025-05-07T19:42:56.2195108Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-05-07T19:42:56.2219304Z ##[group]Checking out the ref 2025-05-07T19:42:56.2219842Z [command]/usr/bin/git checkout --progress --force refs/remotes/pull/4066/merge 2025-05-07T19:42:56.2289284Z Warning: you are leaving 1 commit behind, not connected to 2025-05-07T19:42:56.2289921Z any of your branches: 2025-05-07T19:42:56.2290146Z 2025-05-07T19:42:56.2290524Z 1c9ad64 Merge f6528e7b1e8f5602e7dba30cd73b48ae6630981c into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:56.2291054Z 2025-05-07T19:42:56.2291302Z If you want to keep it by creating a new branch, this may be a good time 2025-05-07T19:42:56.2291719Z to do so with: 2025-05-07T19:42:56.2291854Z 2025-05-07T19:42:56.2292007Z git branch 1c9ad64 2025-05-07T19:42:56.2292218Z 2025-05-07T19:42:56.2292643Z HEAD is now at a2f4c52 Merge 6060cd4b5f971680caecdcc657faccb5720d1c3e into fd4df5f456e0cca514bacd98a39efb72990fd9f4 2025-05-07T19:42:56.2294218Z ##[endgroup] 2025-05-07T19:42:56.2294662Z ##[group]Setting up auth for fetching submodules 2025-05-07T19:42:56.2299092Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-05-07T19:42:56.2331111Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-05-07T19:42:56.2352657Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-05-07T19:42:56.2375022Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-05-07T19:42:56.2396293Z ##[endgroup] 2025-05-07T19:42:56.2396702Z ##[group]Fetching submodules 2025-05-07T19:42:56.2398993Z [command]/usr/bin/git submodule sync 2025-05-07T19:42:56.2690042Z Synchronizing submodule url for 'external/asmjit' 2025-05-07T19:42:56.2691478Z Synchronizing submodule url for 'external/composable_kernel' 2025-05-07T19:42:56.2692793Z Synchronizing submodule url for 'external/cpuinfo' 2025-05-07T19:42:56.2693209Z Synchronizing submodule url for 'external/cutlass' 2025-05-07T19:42:56.2693642Z Synchronizing submodule url for 'external/googletest' 2025-05-07T19:42:56.2694327Z Synchronizing submodule url for 'external/hipify_torch' 2025-05-07T19:42:56.2694757Z Synchronizing submodule url for 'external/json' 2025-05-07T19:42:56.2696343Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --depth=1 2025-05-07T19:42:56.3446950Z Submodule path 'external/asmjit': checked out 'e5d7c0bd5d9aec44d68830187138149e6a8c4e32' 2025-05-07T19:42:56.6066101Z Submodule path 'external/composable_kernel': checked out '4a61bdd4bd4ed730e078aebc7c0fcf046ff29406' 2025-05-07T19:42:56.6991497Z Submodule path 'external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-05-07T19:42:57.3564165Z Submodule path 'external/cutlass': checked out '3ed8d2ec4ba35ef5d9d8353826209b6f868f63d3' 2025-05-07T19:42:57.3942181Z Submodule path 'external/googletest': checked out 'f8d7d77c06936315286eb55f8de22cd23c188571' 2025-05-07T19:42:57.4017165Z Submodule path 'external/hipify_torch': checked out '420084499c7c1e1c2d801922f40df202eac5f3a0' 2025-05-07T19:42:57.5071781Z Submodule path 'external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-05-07T19:42:57.5084471Z [command]/usr/bin/git submodule foreach git config --local gc.auto 0 2025-05-07T19:42:57.5358131Z Entering 'external/asmjit' 2025-05-07T19:42:57.5393940Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.5425803Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.5462662Z Entering 'external/cutlass' 2025-05-07T19:42:57.5492255Z Entering 'external/googletest' 2025-05-07T19:42:57.5521291Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.5550618Z Entering 'external/json' 2025-05-07T19:42:57.5583096Z ##[endgroup] 2025-05-07T19:42:57.5583536Z ##[group]Persisting credentials for submodules 2025-05-07T19:42:57.5585979Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-05-07T19:42:57.5850984Z Entering 'external/asmjit' 2025-05-07T19:42:57.5878819Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5879248Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5910864Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.5955160Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5956181Z url.https://github.com/.insteadof 2025-05-07T19:42:57.5994491Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.6028332Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6028820Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6058084Z Entering 'external/cutlass' 2025-05-07T19:42:57.6100447Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6101064Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6140369Z Entering 'external/googletest' 2025-05-07T19:42:57.6183516Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6184519Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6221361Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.6254838Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6255735Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6293762Z Entering 'external/json' 2025-05-07T19:42:57.6333000Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6333380Z url.https://github.com/.insteadof 2025-05-07T19:42:57.6382608Z [command]/usr/bin/git submodule foreach sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-05-07T19:42:57.6685059Z Entering 'external/asmjit' 2025-05-07T19:42:57.6735554Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/asmjit/config remote.origin.url 2025-05-07T19:42:57.6740176Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.6789199Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/composable_kernel/config remote.origin.url 2025-05-07T19:42:57.6791526Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.6834861Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cpuinfo/config remote.origin.url 2025-05-07T19:42:57.6841805Z Entering 'external/cutlass' 2025-05-07T19:42:57.6886375Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/cutlass/config remote.origin.url 2025-05-07T19:42:57.6889719Z Entering 'external/googletest' 2025-05-07T19:42:57.6941557Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/googletest/config remote.origin.url 2025-05-07T19:42:57.6943089Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.6990705Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/hipify_torch/config remote.origin.url 2025-05-07T19:42:57.6991305Z Entering 'external/json' 2025-05-07T19:42:57.7042433Z file:/__w/FBGEMM/FBGEMM/.git/modules/external/json/config remote.origin.url 2025-05-07T19:42:57.7138421Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-05-07T19:42:57.7412137Z Entering 'external/asmjit' 2025-05-07T19:42:57.7434026Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.7471458Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.7497455Z Entering 'external/cutlass' 2025-05-07T19:42:57.7532472Z Entering 'external/googletest' 2025-05-07T19:42:57.7555144Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.7589893Z Entering 'external/json' 2025-05-07T19:42:57.7632360Z [command]/usr/bin/git submodule foreach git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-05-07T19:42:57.7896895Z Entering 'external/asmjit' 2025-05-07T19:42:57.7916194Z Entering 'external/composable_kernel' 2025-05-07T19:42:57.7940118Z Entering 'external/cpuinfo' 2025-05-07T19:42:57.7974777Z Entering 'external/cutlass' 2025-05-07T19:42:57.8002754Z Entering 'external/googletest' 2025-05-07T19:42:57.8033504Z Entering 'external/hipify_torch' 2025-05-07T19:42:57.8063341Z Entering 'external/json' 2025-05-07T19:42:57.8111084Z ##[endgroup] 2025-05-07T19:42:57.8135744Z [command]/usr/bin/git log -1 --format=%H 2025-05-07T19:42:57.8155548Z a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:42:57.8321813Z ##[group]Run . $PRELUDE; print_system_info 2025-05-07T19:42:57.8322277Z . $PRELUDE; print_system_info 2025-05-07T19:42:57.8322824Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:57.8323188Z env: 2025-05-07T19:42:57.8323454Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:57.8323764Z BUILD_ENV: build_binary 2025-05-07T19:42:57.8324053Z BUILD_TARGET: default 2025-05-07T19:42:57.8324294Z BUILD_VARIANT: cuda 2025-05-07T19:42:57.8324572Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:57.8324833Z ##[endgroup] 2025-05-07T19:42:58.2853430Z ################################################################################ 2025-05-07T19:42:58.2854512Z # Print System Info 2025-05-07T19:42:58.2855158Z # 2025-05-07T19:42:58.2868672Z # [2025-05-07T19:42:58.286Z] + print_system_info 2025-05-07T19:42:58.2869802Z ################################################################################ 2025-05-07T19:42:58.2870502Z 2025-05-07T19:42:58.2870960Z ################################################################################ 2025-05-07T19:42:58.2872000Z [INFO] Printing environment variables ... 2025-05-07T19:42:58.2872720Z + printenv 2025-05-07T19:42:58.2872845Z 2025-05-07T19:42:58.2887648Z GITHUB_WORKSPACE=/__w/FBGEMM/FBGEMM 2025-05-07T19:42:58.2888030Z BUILD_VARIANT=cuda 2025-05-07T19:42:58.2888308Z HOSTNAME=9d6bffcb154a 2025-05-07T19:42:58.2888749Z GITHUB_PATH=/__w/_temp/_runner_file_commands/add_path_a5124a18-31d1-457e-ad1c-0353dbad4543 2025-05-07T19:42:58.2889265Z GITHUB_ACTION=__run_2 2025-05-07T19:42:58.2889542Z GITHUB_RUN_NUMBER=10601 2025-05-07T19:42:58.2889819Z RUNNER_NAME=i-05f9a7b1831803d00 2025-05-07T19:42:58.2890175Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-05-07T19:42:58.2890558Z PLATFORM_NAME_LC=linux-x86_64 2025-05-07T19:42:58.2890857Z MACHINE_NAME_LC=x86_64 2025-05-07T19:42:58.2891103Z GITHUB_TRIGGERING_ACTOR=q10 2025-05-07T19:42:58.2891403Z PRELUDE=.github/scripts/setup_env.bash 2025-05-07T19:42:58.2891712Z GITHUB_REF_TYPE=branch 2025-05-07T19:42:58.2892229Z *** 2025-05-07T19:42:58.2892434Z GITHUB_REPOSITORY_ID=150154628 2025-05-07T19:42:58.2892984Z GITHUB_ACTIONS=true 2025-05-07T19:42:58.2893262Z GITHUB_SHA=a2f4c52051596e74bc8c16e3d2867a4ecdd271e0 2025-05-07T19:42:58.2893858Z GITHUB_WORKFLOW_REF=pytorch/FBGEMM/.github/workflows/fbgemm_gpu_ci_cuda.yml@refs/pull/4066/merge 2025-05-07T19:42:58.2894541Z RUNNER_ENVIRONMENT=self-hosted 2025-05-07T19:42:58.2894821Z GITHUB_REF=refs/pull/4066/merge 2025-05-07T19:42:58.2895101Z RUNNER_OS=Linux 2025-05-07T19:42:58.2895324Z GITHUB_REF_PROTECTED=false 2025-05-07T19:42:58.2895596Z HOME=/github/home 2025-05-07T19:42:58.2895852Z GITHUB_API_URL=https://api.github.com 2025-05-07T19:42:58.2896170Z RUNNER_ARCH=X64 2025-05-07T19:42:58.2896390Z RUNNER_TEMP=/__w/_temp 2025-05-07T19:42:58.2896754Z BUILD_TARGET=default 2025-05-07T19:42:58.2897175Z GITHUB_STATE=/__w/_temp/_runner_file_commands/save_state_a5124a18-31d1-457e-ad1c-0353dbad4543 2025-05-07T19:42:58.2897793Z GITHUB_ENV=/__w/_temp/_runner_file_commands/set_env_a5124a18-31d1-457e-ad1c-0353dbad4543 2025-05-07T19:42:58.2898278Z GITHUB_EVENT_PATH=/github/workflow/event.json 2025-05-07T19:42:58.2898596Z GITHUB_EVENT_NAME=pull_request 2025-05-07T19:42:58.2898871Z GITHUB_RUN_ID=14891846252 2025-05-07T19:42:58.2899313Z GITHUB_STEP_SUMMARY=/__w/_temp/_runner_file_commands/step_summary_a5124a18-31d1-457e-ad1c-0353dbad4543 2025-05-07T19:42:58.2899821Z BUILD_ENV=build_binary 2025-05-07T19:42:58.2900047Z GITHUB_ACTOR=q10 2025-05-07T19:42:58.2900272Z GITHUB_RUN_ATTEMPT=1 2025-05-07T19:42:58.2900508Z KERN_NAME_LC=linux 2025-05-07T19:42:58.2900726Z BUILD_CUDA_VERSION=11.8.0 2025-05-07T19:42:58.2901036Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-05-07T19:42:58.2901368Z PLATFORM_NAME=Linux-x86_64 2025-05-07T19:42:58.2901648Z GITHUB_SERVER_URL=https://github.com 2025-05-07T19:42:58.2901919Z SHLVL=1 2025-05-07T19:42:58.2902122Z GITHUB_ACTOR_ID=255046 2025-05-07T19:42:58.2902354Z RUNNER_TOOL_CACHE=/__w/_tool 2025-05-07T19:42:58.2902868Z GITHUB_WORKFLOW_SHA=6060cd4b5f971680caecdcc657faccb5720d1c3e 2025-05-07T19:42:58.2903229Z GITHUB_REF_NAME=4066/merge 2025-05-07T19:42:58.2903489Z KERN_NAME=Linux 2025-05-07T19:42:58.2903728Z GITHUB_JOB=build_artifact 2025-05-07T19:42:58.2903986Z GITHUB_REPOSITORY=pytorch/FBGEMM 2025-05-07T19:42:58.2904274Z GITHUB_RETENTION_DAYS=90 2025-05-07T19:42:58.2904518Z RUNNER_WORKSPACE=/__w/FBGEMM 2025-05-07T19:42:58.2904829Z GITHUB_ACTION_REPOSITORY= 2025-05-07T19:42:58.2905163Z PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-05-07T19:42:58.2905550Z GITHUB_BASE_REF=main 2025-05-07T19:42:58.2905764Z CI=true 2025-05-07T19:42:58.2905988Z GITHUB_REPOSITORY_OWNER=pytorch 2025-05-07T19:42:58.2906264Z GITHUB_HEAD_REF=bm/genai-rocm-oss-6 2025-05-07T19:42:58.2906731Z GITHUB_ACTION_REF= 2025-05-07T19:42:58.2906977Z GITHUB_WORKFLOW=FBGEMM GPU/GenAI CUDA CI 2025-05-07T19:42:58.2907488Z GITHUB_OUTPUT=/__w/_temp/_runner_file_commands/set_output_a5124a18-31d1-457e-ad1c-0353dbad4543 2025-05-07T19:42:58.2907981Z MACHINE_NAME=x86_64 2025-05-07T19:42:58.2908207Z _=/usr/bin/printenv 2025-05-07T19:42:58.2908344Z 2025-05-07T19:42:58.2908484Z ################################################################################ 2025-05-07T19:42:58.2908810Z [INFO] Print ldd version ... 2025-05-07T19:42:58.2909088Z + ldd --version 2025-05-07T19:42:58.2909220Z 2025-05-07T19:42:58.2909704Z ldd (GNU libc) 2.34 2025-05-07T19:42:58.2910336Z Copyright (C) 2021 Free Software Foundation, Inc. 2025-05-07T19:42:58.2910839Z This is free software; see the source for copying conditions. There is NO 2025-05-07T19:42:58.2911425Z warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 2025-05-07T19:42:58.2911922Z Written by Roland McGrath and Ulrich Drepper. 2025-05-07T19:42:58.2912157Z 2025-05-07T19:42:58.2912281Z ################################################################################ 2025-05-07T19:42:58.2912784Z [INFO] Print CPU info ... 2025-05-07T19:42:58.2913057Z + nproc 2025-05-07T19:42:58.2913180Z 2025-05-07T19:42:58.2925104Z 96 2025-05-07T19:42:58.2925443Z 2025-05-07T19:42:58.2926419Z + lscpu 2025-05-07T19:42:58.2926974Z 2025-05-07T19:42:58.3182734Z Architecture: x86_64 2025-05-07T19:42:58.3183349Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:42:58.3183822Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3184300Z Byte Order: Little Endian 2025-05-07T19:42:58.3184686Z CPU(s): 96 2025-05-07T19:42:58.3185018Z On-line CPU(s) list: 0-95 2025-05-07T19:42:58.3185405Z Vendor ID: GenuineIntel 2025-05-07T19:42:58.3185838Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3186288Z CPU family: 6 2025-05-07T19:42:58.3186658Z Model: 85 2025-05-07T19:42:58.3186986Z Thread(s) per core: 2 2025-05-07T19:42:58.3187371Z Core(s) per socket: 24 2025-05-07T19:42:58.3187693Z Socket(s): 2 2025-05-07T19:42:58.3188054Z Stepping: 7 2025-05-07T19:42:58.3188388Z BogoMIPS: 5999.98 2025-05-07T19:42:58.3190889Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3193507Z Hypervisor vendor: KVM 2025-05-07T19:42:58.3194173Z Virtualization type: full 2025-05-07T19:42:58.3194555Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:42:58.3195009Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:42:58.3195418Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:42:58.3195858Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:42:58.3196232Z NUMA node(s): 2 2025-05-07T19:42:58.3196602Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:42:58.3197009Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:42:58.3197511Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:42:58.3198141Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:42:58.3198690Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:42:58.3199383Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:42:58.3200011Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:42:58.3200713Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:42:58.3201397Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:42:58.3201797Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:42:58.3202224Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:42:58.3202625Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:42:58.3203271Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:42:58.3204185Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:42:58.3204863Z Vulnerability Srbds: Not affected 2025-05-07T19:42:58.3205306Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:42:58.3205572Z 2025-05-07T19:42:58.3205678Z + cat /proc/cpuinfo 2025-05-07T19:42:58.3205967Z 2025-05-07T19:42:58.3206308Z processor : 0 2025-05-07T19:42:58.3206550Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3206856Z cpu family : 6 2025-05-07T19:42:58.3207109Z model : 85 2025-05-07T19:42:58.3207436Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3207830Z stepping : 7 2025-05-07T19:42:58.3208101Z microcode : 0x5003901 2025-05-07T19:42:58.3208395Z cpu MHz : 3304.888 2025-05-07T19:42:58.3208646Z cache size : 36608 KB 2025-05-07T19:42:58.3208941Z physical id : 0 2025-05-07T19:42:58.3209180Z siblings : 48 2025-05-07T19:42:58.3209446Z core id : 0 2025-05-07T19:42:58.3209686Z cpu cores : 24 2025-05-07T19:42:58.3209960Z apicid : 0 2025-05-07T19:42:58.3210188Z initial apicid : 0 2025-05-07T19:42:58.3210474Z fpu : yes 2025-05-07T19:42:58.3210713Z fpu_exception : yes 2025-05-07T19:42:58.3211004Z cpuid level : 13 2025-05-07T19:42:58.3211246Z wp : yes 2025-05-07T19:42:58.3213646Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3216411Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3217066Z bogomips : 5999.98 2025-05-07T19:42:58.3217313Z clflush size : 64 2025-05-07T19:42:58.3217584Z cache_alignment : 64 2025-05-07T19:42:58.3217884Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3218333Z power management: 2025-05-07T19:42:58.3218486Z 2025-05-07T19:42:58.3218587Z processor : 1 2025-05-07T19:42:58.3218860Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3219138Z cpu family : 6 2025-05-07T19:42:58.3219408Z model : 85 2025-05-07T19:42:58.3219717Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3220141Z stepping : 7 2025-05-07T19:42:58.3220415Z microcode : 0x5003901 2025-05-07T19:42:58.3220675Z cpu MHz : 3211.053 2025-05-07T19:42:58.3220956Z cache size : 36608 KB 2025-05-07T19:42:58.3221212Z physical id : 0 2025-05-07T19:42:58.3221480Z siblings : 48 2025-05-07T19:42:58.3221711Z core id : 1 2025-05-07T19:42:58.3221970Z cpu cores : 24 2025-05-07T19:42:58.3222209Z apicid : 2 2025-05-07T19:42:58.3222475Z initial apicid : 2 2025-05-07T19:42:58.3222715Z fpu : yes 2025-05-07T19:42:58.3222969Z fpu_exception : yes 2025-05-07T19:42:58.3223216Z cpuid level : 13 2025-05-07T19:42:58.3223481Z wp : yes 2025-05-07T19:42:58.3225854Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3228634Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3229245Z bogomips : 5999.98 2025-05-07T19:42:58.3229517Z clflush size : 64 2025-05-07T19:42:58.3229758Z cache_alignment : 64 2025-05-07T19:42:58.3230099Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3230456Z power management: 2025-05-07T19:42:58.3230639Z 2025-05-07T19:42:58.3230739Z processor : 2 2025-05-07T19:42:58.3230986Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3231565Z cpu family : 6 2025-05-07T19:42:58.3231796Z model : 85 2025-05-07T19:42:58.3232135Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3232634Z stepping : 7 2025-05-07T19:42:58.3232871Z microcode : 0x5003901 2025-05-07T19:42:58.3233154Z cpu MHz : 3788.686 2025-05-07T19:42:58.3233394Z cache size : 36608 KB 2025-05-07T19:42:58.3233679Z physical id : 0 2025-05-07T19:42:58.3233909Z siblings : 48 2025-05-07T19:42:58.3234158Z core id : 2 2025-05-07T19:42:58.3234377Z cpu cores : 24 2025-05-07T19:42:58.3234628Z apicid : 4 2025-05-07T19:42:58.3234848Z initial apicid : 4 2025-05-07T19:42:58.3235111Z fpu : yes 2025-05-07T19:42:58.3235336Z fpu_exception : yes 2025-05-07T19:42:58.3235604Z cpuid level : 13 2025-05-07T19:42:58.3235863Z wp : yes 2025-05-07T19:42:58.3238209Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3240970Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3241615Z bogomips : 5999.98 2025-05-07T19:42:58.3241864Z clflush size : 64 2025-05-07T19:42:58.3242137Z cache_alignment : 64 2025-05-07T19:42:58.3242447Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3242834Z power management: 2025-05-07T19:42:58.3242981Z 2025-05-07T19:42:58.3243080Z processor : 3 2025-05-07T19:42:58.3243501Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3243778Z cpu family : 6 2025-05-07T19:42:58.3244043Z model : 85 2025-05-07T19:42:58.3244380Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3244767Z stepping : 7 2025-05-07T19:42:58.3245033Z microcode : 0x5003901 2025-05-07T19:42:58.3245288Z cpu MHz : 3246.021 2025-05-07T19:42:58.3245556Z cache size : 36608 KB 2025-05-07T19:42:58.3245815Z physical id : 0 2025-05-07T19:42:58.3246092Z siblings : 48 2025-05-07T19:42:58.3246319Z core id : 3 2025-05-07T19:42:58.3246571Z cpu cores : 24 2025-05-07T19:42:58.3246798Z apicid : 6 2025-05-07T19:42:58.3247056Z initial apicid : 6 2025-05-07T19:42:58.3247297Z fpu : yes 2025-05-07T19:42:58.3247563Z fpu_exception : yes 2025-05-07T19:42:58.3247806Z cpuid level : 13 2025-05-07T19:42:58.3248073Z wp : yes 2025-05-07T19:42:58.3250460Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3253230Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3253851Z bogomips : 5999.98 2025-05-07T19:42:58.3254140Z clflush size : 64 2025-05-07T19:42:58.3254394Z cache_alignment : 64 2025-05-07T19:42:58.3254736Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3255098Z power management: 2025-05-07T19:42:58.3255285Z 2025-05-07T19:42:58.3255386Z processor : 4 2025-05-07T19:42:58.3255638Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3255946Z cpu family : 6 2025-05-07T19:42:58.3256186Z model : 85 2025-05-07T19:42:58.3256598Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3257017Z stepping : 7 2025-05-07T19:42:58.3257258Z microcode : 0x5003901 2025-05-07T19:42:58.3257544Z cpu MHz : 3421.519 2025-05-07T19:42:58.3257784Z cache size : 36608 KB 2025-05-07T19:42:58.3258063Z physical id : 0 2025-05-07T19:42:58.3258294Z siblings : 48 2025-05-07T19:42:58.3258554Z core id : 4 2025-05-07T19:42:58.3258769Z cpu cores : 24 2025-05-07T19:42:58.3259021Z apicid : 8 2025-05-07T19:42:58.3259240Z initial apicid : 8 2025-05-07T19:42:58.3259504Z fpu : yes 2025-05-07T19:42:58.3259725Z fpu_exception : yes 2025-05-07T19:42:58.3259999Z cpuid level : 13 2025-05-07T19:42:58.3260264Z wp : yes 2025-05-07T19:42:58.3262599Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3265329Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3265981Z bogomips : 5999.98 2025-05-07T19:42:58.3266233Z clflush size : 64 2025-05-07T19:42:58.3266511Z cache_alignment : 64 2025-05-07T19:42:58.3266812Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3267195Z power management: 2025-05-07T19:42:58.3267342Z 2025-05-07T19:42:58.3267440Z processor : 5 2025-05-07T19:42:58.3267713Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3267975Z cpu family : 6 2025-05-07T19:42:58.3268310Z model : 85 2025-05-07T19:42:58.3268708Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3269103Z stepping : 7 2025-05-07T19:42:58.3269372Z microcode : 0x5003901 2025-05-07T19:42:58.3269628Z cpu MHz : 3199.401 2025-05-07T19:42:58.3269911Z cache size : 36608 KB 2025-05-07T19:42:58.3270166Z physical id : 0 2025-05-07T19:42:58.3270438Z siblings : 48 2025-05-07T19:42:58.3270661Z core id : 5 2025-05-07T19:42:58.3270910Z cpu cores : 24 2025-05-07T19:42:58.3271143Z apicid : 10 2025-05-07T19:42:58.3271395Z initial apicid : 10 2025-05-07T19:42:58.3271632Z fpu : yes 2025-05-07T19:42:58.3271880Z fpu_exception : yes 2025-05-07T19:42:58.3272145Z cpuid level : 13 2025-05-07T19:42:58.3272376Z wp : yes 2025-05-07T19:42:58.3274823Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3277567Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3278187Z bogomips : 5999.98 2025-05-07T19:42:58.3278462Z clflush size : 64 2025-05-07T19:42:58.3278714Z cache_alignment : 64 2025-05-07T19:42:58.3279047Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3279402Z power management: 2025-05-07T19:42:58.3279572Z 2025-05-07T19:42:58.3279668Z processor : 6 2025-05-07T19:42:58.3279905Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3280196Z cpu family : 6 2025-05-07T19:42:58.3280429Z model : 85 2025-05-07T19:42:58.3280759Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3281160Z stepping : 7 2025-05-07T19:42:58.3281459Z microcode : 0x5003901 2025-05-07T19:42:58.3281737Z cpu MHz : 3247.413 2025-05-07T19:42:58.3281978Z cache size : 36608 KB 2025-05-07T19:42:58.3282254Z physical id : 0 2025-05-07T19:42:58.3282485Z siblings : 48 2025-05-07T19:42:58.3282744Z core id : 6 2025-05-07T19:42:58.3282964Z cpu cores : 24 2025-05-07T19:42:58.3283226Z apicid : 12 2025-05-07T19:42:58.3283452Z initial apicid : 12 2025-05-07T19:42:58.3283716Z fpu : yes 2025-05-07T19:42:58.3283945Z fpu_exception : yes 2025-05-07T19:42:58.3284217Z cpuid level : 13 2025-05-07T19:42:58.3284480Z wp : yes 2025-05-07T19:42:58.3286891Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3289581Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3290219Z bogomips : 5999.98 2025-05-07T19:42:58.3290454Z clflush size : 64 2025-05-07T19:42:58.3290716Z cache_alignment : 64 2025-05-07T19:42:58.3291012Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3291386Z power management: 2025-05-07T19:42:58.3291529Z 2025-05-07T19:42:58.3291627Z processor : 7 2025-05-07T19:42:58.3291885Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3292122Z cpu family : 6 2025-05-07T19:42:58.3292334Z model : 85 2025-05-07T19:42:58.3292633Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3293055Z stepping : 7 2025-05-07T19:42:58.3293315Z microcode : 0x5003901 2025-05-07T19:42:58.3293562Z cpu MHz : 3209.691 2025-05-07T19:42:58.3293835Z cache size : 36608 KB 2025-05-07T19:42:58.3294092Z physical id : 0 2025-05-07T19:42:58.3294362Z siblings : 48 2025-05-07T19:42:58.3294588Z core id : 7 2025-05-07T19:42:58.3294849Z cpu cores : 24 2025-05-07T19:42:58.3295077Z apicid : 14 2025-05-07T19:42:58.3295342Z initial apicid : 14 2025-05-07T19:42:58.3295588Z fpu : yes 2025-05-07T19:42:58.3295860Z fpu_exception : yes 2025-05-07T19:42:58.3296138Z cpuid level : 13 2025-05-07T19:42:58.3296373Z wp : yes 2025-05-07T19:42:58.3298714Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3301391Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3301992Z bogomips : 5999.98 2025-05-07T19:42:58.3302265Z clflush size : 64 2025-05-07T19:42:58.3302515Z cache_alignment : 64 2025-05-07T19:42:58.3302851Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3303206Z power management: 2025-05-07T19:42:58.3303382Z 2025-05-07T19:42:58.3303475Z processor : 8 2025-05-07T19:42:58.3303708Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3303994Z cpu family : 6 2025-05-07T19:42:58.3304245Z model : 85 2025-05-07T19:42:58.3304537Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3304934Z stepping : 7 2025-05-07T19:42:58.3305166Z microcode : 0x5003901 2025-05-07T19:42:58.3305444Z cpu MHz : 3171.522 2025-05-07T19:42:58.3305736Z cache size : 36608 KB 2025-05-07T19:42:58.3306003Z physical id : 0 2025-05-07T19:42:58.3306237Z siblings : 48 2025-05-07T19:42:58.3306481Z core id : 8 2025-05-07T19:42:58.3306701Z cpu cores : 24 2025-05-07T19:42:58.3306954Z apicid : 16 2025-05-07T19:42:58.3307183Z initial apicid : 16 2025-05-07T19:42:58.3307460Z fpu : yes 2025-05-07T19:42:58.3307688Z fpu_exception : yes 2025-05-07T19:42:58.3307966Z cpuid level : 13 2025-05-07T19:42:58.3308219Z wp : yes 2025-05-07T19:42:58.3310507Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3313442Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3314091Z bogomips : 5999.98 2025-05-07T19:42:58.3314340Z clflush size : 64 2025-05-07T19:42:58.3314611Z cache_alignment : 64 2025-05-07T19:42:58.3314924Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3315316Z power management: 2025-05-07T19:42:58.3315467Z 2025-05-07T19:42:58.3315568Z processor : 9 2025-05-07T19:42:58.3315841Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3316108Z cpu family : 6 2025-05-07T19:42:58.3316372Z model : 85 2025-05-07T19:42:58.3316696Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3317076Z stepping : 7 2025-05-07T19:42:58.3317332Z microcode : 0x5003901 2025-05-07T19:42:58.3321921Z cpu MHz : 3200.899 2025-05-07T19:42:58.3322285Z cache size : 36608 KB 2025-05-07T19:42:58.3322564Z physical id : 0 2025-05-07T19:42:58.3322842Z siblings : 48 2025-05-07T19:42:58.3323067Z core id : 9 2025-05-07T19:42:58.3323323Z cpu cores : 24 2025-05-07T19:42:58.3323553Z apicid : 18 2025-05-07T19:42:58.3323808Z initial apicid : 18 2025-05-07T19:42:58.3324051Z fpu : yes 2025-05-07T19:42:58.3324301Z fpu_exception : yes 2025-05-07T19:42:58.3324569Z cpuid level : 13 2025-05-07T19:42:58.3324805Z wp : yes 2025-05-07T19:42:58.3327196Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3329981Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3330602Z bogomips : 5999.98 2025-05-07T19:42:58.3330879Z clflush size : 64 2025-05-07T19:42:58.3331126Z cache_alignment : 64 2025-05-07T19:42:58.3331598Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3331949Z power management: 2025-05-07T19:42:58.3332127Z 2025-05-07T19:42:58.3332225Z processor : 10 2025-05-07T19:42:58.3332617Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3332906Z cpu family : 6 2025-05-07T19:42:58.3333156Z model : 85 2025-05-07T19:42:58.3333460Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3333862Z stepping : 7 2025-05-07T19:42:58.3334102Z microcode : 0x5003901 2025-05-07T19:42:58.3334384Z cpu MHz : 3237.195 2025-05-07T19:42:58.3334635Z cache size : 36608 KB 2025-05-07T19:42:58.3334909Z physical id : 0 2025-05-07T19:42:58.3335251Z siblings : 48 2025-05-07T19:42:58.3335501Z core id : 10 2025-05-07T19:42:58.3335733Z cpu cores : 24 2025-05-07T19:42:58.3335993Z apicid : 20 2025-05-07T19:42:58.3336223Z initial apicid : 20 2025-05-07T19:42:58.3336500Z fpu : yes 2025-05-07T19:42:58.3336720Z fpu_exception : yes 2025-05-07T19:42:58.3336962Z cpuid level : 13 2025-05-07T19:42:58.3337185Z wp : yes 2025-05-07T19:42:58.3339524Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3342243Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3342864Z bogomips : 5999.98 2025-05-07T19:42:58.3343099Z clflush size : 64 2025-05-07T19:42:58.3343330Z cache_alignment : 64 2025-05-07T19:42:58.3343603Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3344014Z power management: 2025-05-07T19:42:58.3344148Z 2025-05-07T19:42:58.3344236Z processor : 11 2025-05-07T19:42:58.3344475Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3345106Z cpu family : 6 2025-05-07T19:42:58.3345348Z model : 85 2025-05-07T19:42:58.3345636Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3346155Z stepping : 7 2025-05-07T19:42:58.3346419Z microcode : 0x5003901 2025-05-07T19:42:58.3346648Z cpu MHz : 3209.706 2025-05-07T19:42:58.3346901Z cache size : 36608 KB 2025-05-07T19:42:58.3347263Z physical id : 0 2025-05-07T19:42:58.3347531Z siblings : 48 2025-05-07T19:42:58.3347769Z core id : 11 2025-05-07T19:42:58.3348036Z cpu cores : 24 2025-05-07T19:42:58.3348352Z apicid : 22 2025-05-07T19:42:58.3348616Z initial apicid : 22 2025-05-07T19:42:58.3348863Z fpu : yes 2025-05-07T19:42:58.3349120Z fpu_exception : yes 2025-05-07T19:42:58.3349401Z cpuid level : 13 2025-05-07T19:42:58.3349650Z wp : yes 2025-05-07T19:42:58.3352028Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3354867Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3355491Z bogomips : 5999.98 2025-05-07T19:42:58.3355718Z clflush size : 64 2025-05-07T19:42:58.3355935Z cache_alignment : 64 2025-05-07T19:42:58.3356220Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3356546Z power management: 2025-05-07T19:42:58.3356694Z 2025-05-07T19:42:58.3356780Z processor : 12 2025-05-07T19:42:58.3356995Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3357246Z cpu family : 6 2025-05-07T19:42:58.3357462Z model : 85 2025-05-07T19:42:58.3357736Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3358101Z stepping : 7 2025-05-07T19:42:58.3358306Z microcode : 0x5003901 2025-05-07T19:42:58.3358547Z cpu MHz : 2999.994 2025-05-07T19:42:58.3358763Z cache size : 36608 KB 2025-05-07T19:42:58.3358999Z physical id : 0 2025-05-07T19:42:58.3359212Z siblings : 48 2025-05-07T19:42:58.3359459Z core id : 12 2025-05-07T19:42:58.3359752Z cpu cores : 24 2025-05-07T19:42:58.3359999Z apicid : 24 2025-05-07T19:42:58.3360228Z initial apicid : 24 2025-05-07T19:42:58.3360494Z fpu : yes 2025-05-07T19:42:58.3360716Z fpu_exception : yes 2025-05-07T19:42:58.3360990Z cpuid level : 13 2025-05-07T19:42:58.3361253Z wp : yes 2025-05-07T19:42:58.3363600Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3366359Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3367004Z bogomips : 5999.98 2025-05-07T19:42:58.3367249Z clflush size : 64 2025-05-07T19:42:58.3367519Z cache_alignment : 64 2025-05-07T19:42:58.3367816Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3368189Z power management: 2025-05-07T19:42:58.3368334Z 2025-05-07T19:42:58.3368430Z processor : 13 2025-05-07T19:42:58.3368698Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3368963Z cpu family : 6 2025-05-07T19:42:58.3369212Z model : 85 2025-05-07T19:42:58.3369544Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3369917Z stepping : 7 2025-05-07T19:42:58.3370176Z microcode : 0x5003901 2025-05-07T19:42:58.3370423Z cpu MHz : 2999.994 2025-05-07T19:42:58.3370685Z cache size : 36608 KB 2025-05-07T19:42:58.3370936Z physical id : 0 2025-05-07T19:42:58.3371200Z siblings : 48 2025-05-07T19:42:58.3371481Z core id : 13 2025-05-07T19:42:58.3371731Z cpu cores : 24 2025-05-07T19:42:58.3371956Z apicid : 26 2025-05-07T19:42:58.3372215Z initial apicid : 26 2025-05-07T19:42:58.3372460Z fpu : yes 2025-05-07T19:42:58.3372800Z fpu_exception : yes 2025-05-07T19:42:58.3373075Z cpuid level : 13 2025-05-07T19:42:58.3373316Z wp : yes 2025-05-07T19:42:58.3375670Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3378400Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3379016Z bogomips : 5999.98 2025-05-07T19:42:58.3379280Z clflush size : 64 2025-05-07T19:42:58.3379516Z cache_alignment : 64 2025-05-07T19:42:58.3379837Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3380183Z power management: 2025-05-07T19:42:58.3380349Z 2025-05-07T19:42:58.3380445Z processor : 14 2025-05-07T19:42:58.3380685Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3380974Z cpu family : 6 2025-05-07T19:42:58.3381231Z model : 85 2025-05-07T19:42:58.3381527Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3381933Z stepping : 7 2025-05-07T19:42:58.3382164Z microcode : 0x5003901 2025-05-07T19:42:58.3382442Z cpu MHz : 2999.994 2025-05-07T19:42:58.3382690Z cache size : 36608 KB 2025-05-07T19:42:58.3382978Z physical id : 0 2025-05-07T19:42:58.3383212Z siblings : 48 2025-05-07T19:42:58.3383464Z core id : 14 2025-05-07T19:42:58.3383689Z cpu cores : 24 2025-05-07T19:42:58.3383944Z apicid : 28 2025-05-07T19:42:58.3384176Z initial apicid : 28 2025-05-07T19:42:58.3384501Z fpu : yes 2025-05-07T19:42:58.3384727Z fpu_exception : yes 2025-05-07T19:42:58.3384995Z cpuid level : 13 2025-05-07T19:42:58.3385263Z wp : yes 2025-05-07T19:42:58.3387632Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3390414Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3391055Z bogomips : 5999.98 2025-05-07T19:42:58.3391297Z clflush size : 64 2025-05-07T19:42:58.3391578Z cache_alignment : 64 2025-05-07T19:42:58.3391876Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3392265Z power management: 2025-05-07T19:42:58.3392414Z 2025-05-07T19:42:58.3392585Z processor : 15 2025-05-07T19:42:58.3392867Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3393234Z cpu family : 6 2025-05-07T19:42:58.3393503Z model : 85 2025-05-07T19:42:58.3393842Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3394294Z stepping : 7 2025-05-07T19:42:58.3394562Z microcode : 0x5003901 2025-05-07T19:42:58.3394813Z cpu MHz : 3214.086 2025-05-07T19:42:58.3395096Z cache size : 36608 KB 2025-05-07T19:42:58.3395356Z physical id : 0 2025-05-07T19:42:58.3395633Z siblings : 48 2025-05-07T19:42:58.3395861Z core id : 15 2025-05-07T19:42:58.3396125Z cpu cores : 24 2025-05-07T19:42:58.3396368Z apicid : 30 2025-05-07T19:42:58.3396710Z initial apicid : 30 2025-05-07T19:42:58.3396955Z fpu : yes 2025-05-07T19:42:58.3397218Z fpu_exception : yes 2025-05-07T19:42:58.3397468Z cpuid level : 13 2025-05-07T19:42:58.3397673Z wp : yes 2025-05-07T19:42:58.3400026Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3402732Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3403328Z bogomips : 5999.98 2025-05-07T19:42:58.3403558Z clflush size : 64 2025-05-07T19:42:58.3403777Z cache_alignment : 64 2025-05-07T19:42:58.3404060Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3404387Z power management: 2025-05-07T19:42:58.3404533Z 2025-05-07T19:42:58.3404617Z processor : 16 2025-05-07T19:42:58.3404835Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3405087Z cpu family : 6 2025-05-07T19:42:58.3405304Z model : 85 2025-05-07T19:42:58.3405576Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3405944Z stepping : 7 2025-05-07T19:42:58.3406151Z microcode : 0x5003901 2025-05-07T19:42:58.3406421Z cpu MHz : 3129.901 2025-05-07T19:42:58.3406663Z cache size : 36608 KB 2025-05-07T19:42:58.3406941Z physical id : 0 2025-05-07T19:42:58.3407176Z siblings : 48 2025-05-07T19:42:58.3407424Z core id : 16 2025-05-07T19:42:58.3407650Z cpu cores : 24 2025-05-07T19:42:58.3407903Z apicid : 32 2025-05-07T19:42:58.3408131Z initial apicid : 32 2025-05-07T19:42:58.3408402Z fpu : yes 2025-05-07T19:42:58.3408649Z fpu_exception : yes 2025-05-07T19:42:58.3408960Z cpuid level : 13 2025-05-07T19:42:58.3409215Z wp : yes 2025-05-07T19:42:58.3411545Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3414301Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3414962Z bogomips : 5999.98 2025-05-07T19:42:58.3415211Z clflush size : 64 2025-05-07T19:42:58.3415479Z cache_alignment : 64 2025-05-07T19:42:58.3415780Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3416156Z power management: 2025-05-07T19:42:58.3416424Z 2025-05-07T19:42:58.3416569Z processor : 17 2025-05-07T19:42:58.3416858Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3417151Z cpu family : 6 2025-05-07T19:42:58.3417374Z model : 85 2025-05-07T19:42:58.3417702Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3418079Z stepping : 7 2025-05-07T19:42:58.3418335Z microcode : 0x5003901 2025-05-07T19:42:58.3418583Z cpu MHz : 3258.819 2025-05-07T19:42:58.3418855Z cache size : 36608 KB 2025-05-07T19:42:58.3419107Z physical id : 0 2025-05-07T19:42:58.3419366Z siblings : 48 2025-05-07T19:42:58.3419589Z core id : 17 2025-05-07T19:42:58.3419837Z cpu cores : 24 2025-05-07T19:42:58.3420066Z apicid : 34 2025-05-07T19:42:58.3420323Z initial apicid : 34 2025-05-07T19:42:58.3420556Z fpu : yes 2025-05-07T19:42:58.3420867Z fpu_exception : yes 2025-05-07T19:42:58.3421139Z cpuid level : 13 2025-05-07T19:42:58.3421375Z wp : yes 2025-05-07T19:42:58.3423740Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3426459Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3427071Z bogomips : 5999.98 2025-05-07T19:42:58.3427332Z clflush size : 64 2025-05-07T19:42:58.3427576Z cache_alignment : 64 2025-05-07T19:42:58.3427898Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3428261Z power management: 2025-05-07T19:42:58.3428441Z 2025-05-07T19:42:58.3428540Z processor : 18 2025-05-07T19:42:58.3428781Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3429076Z cpu family : 6 2025-05-07T19:42:58.3429329Z model : 85 2025-05-07T19:42:58.3429632Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3430032Z stepping : 7 2025-05-07T19:42:58.3430266Z microcode : 0x5003901 2025-05-07T19:42:58.3430545Z cpu MHz : 3224.988 2025-05-07T19:42:58.3430783Z cache size : 36608 KB 2025-05-07T19:42:58.3431059Z physical id : 0 2025-05-07T19:42:58.3431440Z siblings : 48 2025-05-07T19:42:58.3431699Z core id : 18 2025-05-07T19:42:58.3432038Z cpu cores : 24 2025-05-07T19:42:58.3432312Z apicid : 36 2025-05-07T19:42:58.3432675Z initial apicid : 36 2025-05-07T19:42:58.3432942Z fpu : yes 2025-05-07T19:42:58.3433208Z fpu_exception : yes 2025-05-07T19:42:58.3433465Z cpuid level : 13 2025-05-07T19:42:58.3433742Z wp : yes 2025-05-07T19:42:58.3436101Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3438977Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3439636Z bogomips : 5999.98 2025-05-07T19:42:58.3439886Z clflush size : 64 2025-05-07T19:42:58.3440176Z cache_alignment : 64 2025-05-07T19:42:58.3440496Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3440884Z power management: 2025-05-07T19:42:58.3441040Z 2025-05-07T19:42:58.3441142Z processor : 19 2025-05-07T19:42:58.3441424Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3466601Z cpu family : 6 2025-05-07T19:42:58.3466888Z model : 85 2025-05-07T19:42:58.3467206Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3467613Z stepping : 7 2025-05-07T19:42:58.3467887Z microcode : 0x5003901 2025-05-07T19:42:58.3468155Z cpu MHz : 3234.335 2025-05-07T19:42:58.3468397Z cache size : 36608 KB 2025-05-07T19:42:58.3468673Z physical id : 0 2025-05-07T19:42:58.3468905Z siblings : 48 2025-05-07T19:42:58.3469151Z core id : 19 2025-05-07T19:42:58.3469371Z cpu cores : 24 2025-05-07T19:42:58.3469617Z apicid : 38 2025-05-07T19:42:58.3469838Z initial apicid : 38 2025-05-07T19:42:58.3470097Z fpu : yes 2025-05-07T19:42:58.3470337Z fpu_exception : yes 2025-05-07T19:42:58.3470573Z cpuid level : 13 2025-05-07T19:42:58.3470816Z wp : yes 2025-05-07T19:42:58.3473571Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3476403Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3477050Z bogomips : 5999.98 2025-05-07T19:42:58.3477274Z clflush size : 64 2025-05-07T19:42:58.3477540Z cache_alignment : 64 2025-05-07T19:42:58.3477835Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3478226Z power management: 2025-05-07T19:42:58.3478378Z 2025-05-07T19:42:58.3478480Z processor : 20 2025-05-07T19:42:58.3478746Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3479032Z cpu family : 6 2025-05-07T19:42:58.3479256Z model : 85 2025-05-07T19:42:58.3479576Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3479955Z stepping : 7 2025-05-07T19:42:58.3480209Z microcode : 0x5003901 2025-05-07T19:42:58.3480456Z cpu MHz : 2999.994 2025-05-07T19:42:58.3480720Z cache size : 36608 KB 2025-05-07T19:42:58.3480965Z physical id : 0 2025-05-07T19:42:58.3481221Z siblings : 48 2025-05-07T19:42:58.3481437Z core id : 20 2025-05-07T19:42:58.3481679Z cpu cores : 24 2025-05-07T19:42:58.3481910Z apicid : 40 2025-05-07T19:42:58.3482154Z initial apicid : 40 2025-05-07T19:42:58.3482410Z fpu : yes 2025-05-07T19:42:58.3482624Z fpu_exception : yes 2025-05-07T19:42:58.3482895Z cpuid level : 13 2025-05-07T19:42:58.3483125Z wp : yes 2025-05-07T19:42:58.3485582Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3488205Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3488788Z bogomips : 5999.98 2025-05-07T19:42:58.3489064Z clflush size : 64 2025-05-07T19:42:58.3489272Z cache_alignment : 64 2025-05-07T19:42:58.3489572Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3489873Z power management: 2025-05-07T19:42:58.3490038Z 2025-05-07T19:42:58.3490120Z processor : 21 2025-05-07T19:42:58.3490367Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3490612Z cpu family : 6 2025-05-07T19:42:58.3490852Z model : 85 2025-05-07T19:42:58.3491130Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3491502Z stepping : 7 2025-05-07T19:42:58.3491714Z microcode : 0x5003901 2025-05-07T19:42:58.3491961Z cpu MHz : 2999.994 2025-05-07T19:42:58.3492173Z cache size : 36608 KB 2025-05-07T19:42:58.3492411Z physical id : 0 2025-05-07T19:42:58.3492611Z siblings : 48 2025-05-07T19:42:58.3492815Z core id : 21 2025-05-07T19:42:58.3493010Z cpu cores : 24 2025-05-07T19:42:58.3493226Z apicid : 42 2025-05-07T19:42:58.3493426Z initial apicid : 42 2025-05-07T19:42:58.3493655Z fpu : yes 2025-05-07T19:42:58.3493873Z fpu_exception : yes 2025-05-07T19:42:58.3494090Z cpuid level : 13 2025-05-07T19:42:58.3494314Z wp : yes 2025-05-07T19:42:58.3496524Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3499031Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3499602Z bogomips : 5999.98 2025-05-07T19:42:58.3499811Z clflush size : 64 2025-05-07T19:42:58.3500035Z cache_alignment : 64 2025-05-07T19:42:58.3500294Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3500619Z power management: 2025-05-07T19:42:58.3500744Z 2025-05-07T19:42:58.3500819Z processor : 22 2025-05-07T19:42:58.3501039Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3501284Z cpu family : 6 2025-05-07T19:42:58.3501472Z model : 85 2025-05-07T19:42:58.3501748Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3502086Z stepping : 7 2025-05-07T19:42:58.3502305Z microcode : 0x5003901 2025-05-07T19:42:58.3502511Z cpu MHz : 2999.994 2025-05-07T19:42:58.3502741Z cache size : 36608 KB 2025-05-07T19:42:58.3502964Z physical id : 0 2025-05-07T19:42:58.3503187Z siblings : 48 2025-05-07T19:42:58.3503383Z core id : 22 2025-05-07T19:42:58.3503594Z cpu cores : 24 2025-05-07T19:42:58.3503793Z apicid : 44 2025-05-07T19:42:58.3504005Z initial apicid : 44 2025-05-07T19:42:58.3504230Z fpu : yes 2025-05-07T19:42:58.3504423Z fpu_exception : yes 2025-05-07T19:42:58.3504651Z cpuid level : 13 2025-05-07T19:42:58.3504856Z wp : yes 2025-05-07T19:42:58.3507027Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3509599Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3510153Z bogomips : 5999.98 2025-05-07T19:42:58.3510389Z clflush size : 64 2025-05-07T19:42:58.3510608Z cache_alignment : 64 2025-05-07T19:42:58.3510903Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3511233Z power management: 2025-05-07T19:42:58.3511390Z 2025-05-07T19:42:58.3511477Z processor : 23 2025-05-07T19:42:58.3511714Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3511944Z cpu family : 6 2025-05-07T19:42:58.3512162Z model : 85 2025-05-07T19:42:58.3512495Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3513036Z stepping : 7 2025-05-07T19:42:58.3513255Z microcode : 0x5003901 2025-05-07T19:42:58.3513516Z cpu MHz : 3221.117 2025-05-07T19:42:58.3513813Z cache size : 36608 KB 2025-05-07T19:42:58.3514077Z physical id : 0 2025-05-07T19:42:58.3514295Z siblings : 48 2025-05-07T19:42:58.3514525Z core id : 23 2025-05-07T19:42:58.3514739Z cpu cores : 24 2025-05-07T19:42:58.3514969Z apicid : 46 2025-05-07T19:42:58.3515204Z initial apicid : 46 2025-05-07T19:42:58.3515431Z fpu : yes 2025-05-07T19:42:58.3515660Z fpu_exception : yes 2025-05-07T19:42:58.3515892Z cpuid level : 13 2025-05-07T19:42:58.3516123Z wp : yes 2025-05-07T19:42:58.3518530Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3521256Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3521874Z bogomips : 5999.98 2025-05-07T19:42:58.3522095Z clflush size : 64 2025-05-07T19:42:58.3522334Z cache_alignment : 64 2025-05-07T19:42:58.3522611Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3522965Z power management: 2025-05-07T19:42:58.3523100Z 2025-05-07T19:42:58.3523190Z processor : 24 2025-05-07T19:42:58.3523427Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3523692Z cpu family : 6 2025-05-07T19:42:58.3523903Z model : 85 2025-05-07T19:42:58.3524199Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3524560Z stepping : 7 2025-05-07T19:42:58.3524794Z microcode : 0x5003901 2025-05-07T19:42:58.3525139Z cpu MHz : 1269.604 2025-05-07T19:42:58.3525372Z cache size : 36608 KB 2025-05-07T19:42:58.3525587Z physical id : 1 2025-05-07T19:42:58.3525804Z siblings : 48 2025-05-07T19:42:58.3525994Z core id : 0 2025-05-07T19:42:58.3526192Z cpu cores : 24 2025-05-07T19:42:58.3526381Z apicid : 64 2025-05-07T19:42:58.3526586Z initial apicid : 64 2025-05-07T19:42:58.3526803Z fpu : yes 2025-05-07T19:42:58.3526992Z fpu_exception : yes 2025-05-07T19:42:58.3527214Z cpuid level : 13 2025-05-07T19:42:58.3527413Z wp : yes 2025-05-07T19:42:58.3529594Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3532544Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3533148Z bogomips : 5999.98 2025-05-07T19:42:58.3533457Z clflush size : 64 2025-05-07T19:42:58.3533687Z cache_alignment : 64 2025-05-07T19:42:58.3533984Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3534323Z power management: 2025-05-07T19:42:58.3534487Z 2025-05-07T19:42:58.3534576Z processor : 25 2025-05-07T19:42:58.3534824Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3535075Z cpu family : 6 2025-05-07T19:42:58.3535275Z model : 85 2025-05-07T19:42:58.3535579Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3535948Z stepping : 7 2025-05-07T19:42:58.3536182Z microcode : 0x5003901 2025-05-07T19:42:58.3536410Z cpu MHz : 1541.698 2025-05-07T19:42:58.3536646Z cache size : 36608 KB 2025-05-07T19:42:58.3536871Z physical id : 1 2025-05-07T19:42:58.3537091Z siblings : 48 2025-05-07T19:42:58.3537297Z core id : 1 2025-05-07T19:42:58.3537492Z cpu cores : 24 2025-05-07T19:42:58.3537708Z apicid : 66 2025-05-07T19:42:58.3537905Z initial apicid : 66 2025-05-07T19:42:58.3538138Z fpu : yes 2025-05-07T19:42:58.3538337Z fpu_exception : yes 2025-05-07T19:42:58.3538585Z cpuid level : 13 2025-05-07T19:42:58.3538792Z wp : yes 2025-05-07T19:42:58.3541273Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3544083Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3544757Z bogomips : 5999.98 2025-05-07T19:42:58.3544986Z clflush size : 64 2025-05-07T19:42:58.3545188Z cache_alignment : 64 2025-05-07T19:42:58.3545449Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3545759Z power management: 2025-05-07T19:42:58.3545880Z 2025-05-07T19:42:58.3545951Z processor : 26 2025-05-07T19:42:58.3546145Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3546354Z cpu family : 6 2025-05-07T19:42:58.3546545Z model : 85 2025-05-07T19:42:58.3546792Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3547118Z stepping : 7 2025-05-07T19:42:58.3547298Z microcode : 0x5003901 2025-05-07T19:42:58.3547509Z cpu MHz : 1546.195 2025-05-07T19:42:58.3547700Z cache size : 36608 KB 2025-05-07T19:42:58.3547910Z physical id : 1 2025-05-07T19:42:58.3548098Z siblings : 48 2025-05-07T19:42:58.3548270Z core id : 2 2025-05-07T19:42:58.3548457Z cpu cores : 24 2025-05-07T19:42:58.3548632Z apicid : 68 2025-05-07T19:42:58.3548818Z initial apicid : 68 2025-05-07T19:42:58.3549005Z fpu : yes 2025-05-07T19:42:58.3549198Z fpu_exception : yes 2025-05-07T19:42:58.3549394Z cpuid level : 13 2025-05-07T19:42:58.3549603Z wp : yes 2025-05-07T19:42:58.3551750Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3554689Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3555291Z bogomips : 5999.98 2025-05-07T19:42:58.3555508Z clflush size : 64 2025-05-07T19:42:58.3555747Z cache_alignment : 64 2025-05-07T19:42:58.3556039Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3556374Z power management: 2025-05-07T19:42:58.3556506Z 2025-05-07T19:42:58.3556604Z processor : 27 2025-05-07T19:42:58.3556832Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3557081Z cpu family : 6 2025-05-07T19:42:58.3557280Z model : 85 2025-05-07T19:42:58.3557564Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3557915Z stepping : 7 2025-05-07T19:42:58.3558150Z microcode : 0x5003901 2025-05-07T19:42:58.3558370Z cpu MHz : 2999.994 2025-05-07T19:42:58.3558606Z cache size : 36608 KB 2025-05-07T19:42:58.3558824Z physical id : 1 2025-05-07T19:42:58.3559030Z siblings : 48 2025-05-07T19:42:58.3559244Z core id : 3 2025-05-07T19:42:58.3559439Z cpu cores : 24 2025-05-07T19:42:58.3559656Z apicid : 70 2025-05-07T19:42:58.3559859Z initial apicid : 70 2025-05-07T19:42:58.3560077Z fpu : yes 2025-05-07T19:42:58.3560278Z fpu_exception : yes 2025-05-07T19:42:58.3560508Z cpuid level : 13 2025-05-07T19:42:58.3560709Z wp : yes 2025-05-07T19:42:58.3563091Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3565877Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3566419Z bogomips : 5999.98 2025-05-07T19:42:58.3566649Z clflush size : 64 2025-05-07T19:42:58.3566856Z cache_alignment : 64 2025-05-07T19:42:58.3567138Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3567456Z power management: 2025-05-07T19:42:58.3567577Z 2025-05-07T19:42:58.3567654Z processor : 28 2025-05-07T19:42:58.3567858Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3568071Z cpu family : 6 2025-05-07T19:42:58.3568265Z model : 85 2025-05-07T19:42:58.3568513Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3568864Z stepping : 7 2025-05-07T19:42:58.3569070Z microcode : 0x5003901 2025-05-07T19:42:58.3569286Z cpu MHz : 1200.646 2025-05-07T19:42:58.3569498Z cache size : 36608 KB 2025-05-07T19:42:58.3569713Z physical id : 1 2025-05-07T19:42:58.3569923Z siblings : 48 2025-05-07T19:42:58.3570112Z core id : 4 2025-05-07T19:42:58.3570307Z cpu cores : 24 2025-05-07T19:42:58.3570495Z apicid : 72 2025-05-07T19:42:58.3570701Z initial apicid : 72 2025-05-07T19:42:58.3570897Z fpu : yes 2025-05-07T19:42:58.3571098Z fpu_exception : yes 2025-05-07T19:42:58.3571296Z cpuid level : 13 2025-05-07T19:42:58.3571501Z wp : yes 2025-05-07T19:42:58.3573640Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3576174Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3576720Z bogomips : 5999.98 2025-05-07T19:42:58.3576910Z clflush size : 64 2025-05-07T19:42:58.3577116Z cache_alignment : 64 2025-05-07T19:42:58.3577370Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3577665Z power management: 2025-05-07T19:42:58.3577784Z 2025-05-07T19:42:58.3577873Z processor : 29 2025-05-07T19:42:58.3578068Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3578290Z cpu family : 6 2025-05-07T19:42:58.3578471Z model : 85 2025-05-07T19:42:58.3578736Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3579058Z stepping : 7 2025-05-07T19:42:58.3579257Z microcode : 0x5003901 2025-05-07T19:42:58.3579466Z cpu MHz : 1200.801 2025-05-07T19:42:58.3579675Z cache size : 36608 KB 2025-05-07T19:42:58.3579888Z physical id : 1 2025-05-07T19:42:58.3580098Z siblings : 48 2025-05-07T19:42:58.3580296Z core id : 5 2025-05-07T19:42:58.3580489Z cpu cores : 24 2025-05-07T19:42:58.3580745Z apicid : 74 2025-05-07T19:42:58.3580969Z initial apicid : 74 2025-05-07T19:42:58.3581239Z fpu : yes 2025-05-07T19:42:58.3581451Z fpu_exception : yes 2025-05-07T19:42:58.3581721Z cpuid level : 13 2025-05-07T19:42:58.3581956Z wp : yes 2025-05-07T19:42:58.3584260Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3586818Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3587403Z bogomips : 5999.98 2025-05-07T19:42:58.3587666Z clflush size : 64 2025-05-07T19:42:58.3587899Z cache_alignment : 64 2025-05-07T19:42:58.3588196Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3588551Z power management: 2025-05-07T19:42:58.3588689Z 2025-05-07T19:42:58.3588778Z processor : 30 2025-05-07T19:42:58.3589036Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3589286Z cpu family : 6 2025-05-07T19:42:58.3589526Z model : 85 2025-05-07T19:42:58.3589802Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3590174Z stepping : 7 2025-05-07T19:42:58.3590389Z microcode : 0x5003901 2025-05-07T19:42:58.3590657Z cpu MHz : 2999.994 2025-05-07T19:42:58.3590887Z cache size : 36608 KB 2025-05-07T19:42:58.3591154Z physical id : 1 2025-05-07T19:42:58.3591410Z siblings : 48 2025-05-07T19:42:58.3591639Z core id : 6 2025-05-07T19:42:58.3591879Z cpu cores : 24 2025-05-07T19:42:58.3592096Z apicid : 76 2025-05-07T19:42:58.3592344Z initial apicid : 76 2025-05-07T19:42:58.3592656Z fpu : yes 2025-05-07T19:42:58.3593064Z fpu_exception : yes 2025-05-07T19:42:58.3593308Z cpuid level : 13 2025-05-07T19:42:58.3593588Z wp : yes 2025-05-07T19:42:58.3595913Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3598724Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3599382Z bogomips : 5999.98 2025-05-07T19:42:58.3599637Z clflush size : 64 2025-05-07T19:42:58.3599930Z cache_alignment : 64 2025-05-07T19:42:58.3600274Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3600642Z power management: 2025-05-07T19:42:58.3600796Z 2025-05-07T19:42:58.3600932Z processor : 31 2025-05-07T19:42:58.3601173Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3601459Z cpu family : 6 2025-05-07T19:42:58.3601683Z model : 85 2025-05-07T19:42:58.3602009Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3602383Z stepping : 7 2025-05-07T19:42:58.3602643Z microcode : 0x5003901 2025-05-07T19:42:58.3602894Z cpu MHz : 1239.440 2025-05-07T19:42:58.3603156Z cache size : 36608 KB 2025-05-07T19:42:58.3603404Z physical id : 1 2025-05-07T19:42:58.3603671Z siblings : 48 2025-05-07T19:42:58.3603921Z core id : 7 2025-05-07T19:42:58.3604147Z cpu cores : 24 2025-05-07T19:42:58.3604407Z apicid : 78 2025-05-07T19:42:58.3604632Z initial apicid : 78 2025-05-07T19:42:58.3604901Z fpu : yes 2025-05-07T19:42:58.3605227Z fpu_exception : yes 2025-05-07T19:42:58.3605481Z cpuid level : 13 2025-05-07T19:42:58.3605703Z wp : yes 2025-05-07T19:42:58.3607941Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3610489Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3611067Z bogomips : 5999.98 2025-05-07T19:42:58.3611312Z clflush size : 64 2025-05-07T19:42:58.3611539Z cache_alignment : 64 2025-05-07T19:42:58.3611845Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3612209Z power management: 2025-05-07T19:42:58.3612345Z 2025-05-07T19:42:58.3612438Z processor : 32 2025-05-07T19:42:58.3612695Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3612946Z cpu family : 6 2025-05-07T19:42:58.3613185Z model : 85 2025-05-07T19:42:58.3613471Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3613857Z stepping : 7 2025-05-07T19:42:58.3614078Z microcode : 0x5003901 2025-05-07T19:42:58.3614354Z cpu MHz : 2999.994 2025-05-07T19:42:58.3614592Z cache size : 36608 KB 2025-05-07T19:42:58.3614874Z physical id : 1 2025-05-07T19:42:58.3615129Z siblings : 48 2025-05-07T19:42:58.3615350Z core id : 8 2025-05-07T19:42:58.3615586Z cpu cores : 24 2025-05-07T19:42:58.3615808Z apicid : 80 2025-05-07T19:42:58.3616054Z initial apicid : 80 2025-05-07T19:42:58.3616288Z fpu : yes 2025-05-07T19:42:58.3616514Z fpu_exception : yes 2025-05-07T19:42:58.3616743Z cpuid level : 13 2025-05-07T19:42:58.3616998Z wp : yes 2025-05-07T19:42:58.3619149Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3621635Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3622241Z bogomips : 5999.98 2025-05-07T19:42:58.3622452Z clflush size : 64 2025-05-07T19:42:58.3622674Z cache_alignment : 64 2025-05-07T19:42:58.3622961Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3623281Z power management: 2025-05-07T19:42:58.3623413Z 2025-05-07T19:42:58.3623513Z processor : 33 2025-05-07T19:42:58.3623717Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3623952Z cpu family : 6 2025-05-07T19:42:58.3624155Z model : 85 2025-05-07T19:42:58.3624433Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3624773Z stepping : 7 2025-05-07T19:42:58.3624991Z microcode : 0x5003901 2025-05-07T19:42:58.3625211Z cpu MHz : 2999.994 2025-05-07T19:42:58.3625444Z cache size : 36608 KB 2025-05-07T19:42:58.3625677Z physical id : 1 2025-05-07T19:42:58.3625891Z siblings : 48 2025-05-07T19:42:58.3626100Z core id : 9 2025-05-07T19:42:58.3626295Z cpu cores : 24 2025-05-07T19:42:58.3626515Z apicid : 82 2025-05-07T19:42:58.3626712Z initial apicid : 82 2025-05-07T19:42:58.3626948Z fpu : yes 2025-05-07T19:42:58.3627133Z fpu_exception : yes 2025-05-07T19:42:58.3627796Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:42:58.3628101Z cpuid level : 13 2025-05-07T19:42:58.3628319Z wp : yes 2025-05-07T19:42:58.3630471Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3633622Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3634238Z bogomips : 5999.98 2025-05-07T19:42:58.3634472Z clflush size : 64 2025-05-07T19:42:58.3634702Z cache_alignment : 64 2025-05-07T19:42:58.3634991Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3635325Z power management: 2025-05-07T19:42:58.3635467Z 2025-05-07T19:42:58.3635558Z processor : 34 2025-05-07T19:42:58.3635773Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3636033Z cpu family : 6 2025-05-07T19:42:58.3636243Z model : 85 2025-05-07T19:42:58.3636534Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3636906Z stepping : 7 2025-05-07T19:42:58.3637135Z microcode : 0x5003901 2025-05-07T19:42:58.3637393Z cpu MHz : 2999.994 2025-05-07T19:42:58.3637612Z cache size : 36608 KB 2025-05-07T19:42:58.3637844Z physical id : 1 2025-05-07T19:42:58.3638063Z siblings : 48 2025-05-07T19:42:58.3638284Z core id : 10 2025-05-07T19:42:58.3638492Z cpu cores : 24 2025-05-07T19:42:58.3638704Z apicid : 84 2025-05-07T19:42:58.3638916Z initial apicid : 84 2025-05-07T19:42:58.3639162Z fpu : yes 2025-05-07T19:42:58.3639371Z fpu_exception : yes 2025-05-07T19:42:58.3639484Z cpuid level : 13 2025-05-07T19:42:58.3639575Z wp : yes 2025-05-07T19:42:58.3641793Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3642219Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3644236Z bogomips : 5999.98 2025-05-07T19:42:58.3644331Z clflush size : 64 2025-05-07T19:42:58.3644586Z cache_alignment : 64 2025-05-07T19:42:58.3644714Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3644803Z power management: 2025-05-07T19:42:58.3644807Z 2025-05-07T19:42:58.3644911Z processor : 35 2025-05-07T19:42:58.3645003Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3645084Z cpu family : 6 2025-05-07T19:42:58.3645159Z model : 85 2025-05-07T19:42:58.3645330Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3645408Z stepping : 7 2025-05-07T19:42:58.3645492Z microcode : 0x5003901 2025-05-07T19:42:58.3645590Z cpu MHz : 2999.994 2025-05-07T19:42:58.3645669Z cache size : 36608 KB 2025-05-07T19:42:58.3645751Z physical id : 1 2025-05-07T19:42:58.3645832Z siblings : 48 2025-05-07T19:42:58.3645921Z core id : 11 2025-05-07T19:42:58.3646002Z cpu cores : 24 2025-05-07T19:42:58.3646092Z apicid : 86 2025-05-07T19:42:58.3646182Z initial apicid : 86 2025-05-07T19:42:58.3646283Z fpu : yes 2025-05-07T19:42:58.3646368Z fpu_exception : yes 2025-05-07T19:42:58.3646446Z cpuid level : 13 2025-05-07T19:42:58.3646540Z wp : yes 2025-05-07T19:42:58.3648595Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3649022Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3649129Z bogomips : 5999.98 2025-05-07T19:42:58.3649210Z clflush size : 64 2025-05-07T19:42:58.3649298Z cache_alignment : 64 2025-05-07T19:42:58.3649447Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3649533Z power management: 2025-05-07T19:42:58.3649537Z 2025-05-07T19:42:58.3649614Z processor : 36 2025-05-07T19:42:58.3649715Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3649805Z cpu family : 6 2025-05-07T19:42:58.3649878Z model : 85 2025-05-07T19:42:58.3650035Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3650134Z stepping : 7 2025-05-07T19:42:58.3650224Z microcode : 0x5003901 2025-05-07T19:42:58.3650301Z cpu MHz : 2999.994 2025-05-07T19:42:58.3650401Z cache size : 36608 KB 2025-05-07T19:42:58.3650479Z physical id : 1 2025-05-07T19:42:58.3650551Z siblings : 48 2025-05-07T19:42:58.3650630Z core id : 12 2025-05-07T19:42:58.3650736Z cpu cores : 24 2025-05-07T19:42:58.3650820Z apicid : 88 2025-05-07T19:42:58.3650901Z initial apicid : 88 2025-05-07T19:42:58.3650976Z fpu : yes 2025-05-07T19:42:58.3651079Z fpu_exception : yes 2025-05-07T19:42:58.3651156Z cpuid level : 13 2025-05-07T19:42:58.3651229Z wp : yes 2025-05-07T19:42:58.3653299Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3653667Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3653746Z bogomips : 5999.98 2025-05-07T19:42:58.3653840Z clflush size : 64 2025-05-07T19:42:58.3653971Z cache_alignment : 64 2025-05-07T19:42:58.3654104Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3654200Z power management: 2025-05-07T19:42:58.3654204Z 2025-05-07T19:42:58.3654286Z processor : 37 2025-05-07T19:42:58.3654376Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3654458Z cpu family : 6 2025-05-07T19:42:58.3654552Z model : 85 2025-05-07T19:42:58.3654708Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3654788Z stepping : 7 2025-05-07T19:42:58.3654887Z microcode : 0x5003901 2025-05-07T19:42:58.3654970Z cpu MHz : 2999.994 2025-05-07T19:42:58.3655055Z cache size : 36608 KB 2025-05-07T19:42:58.3655137Z physical id : 1 2025-05-07T19:42:58.3655222Z siblings : 48 2025-05-07T19:42:58.3655301Z core id : 13 2025-05-07T19:42:58.3655384Z cpu cores : 24 2025-05-07T19:42:58.3655478Z apicid : 90 2025-05-07T19:42:58.3655559Z initial apicid : 90 2025-05-07T19:42:58.3655640Z fpu : yes 2025-05-07T19:42:58.3655721Z fpu_exception : yes 2025-05-07T19:42:58.3655821Z cpuid level : 13 2025-05-07T19:42:58.3655895Z wp : yes 2025-05-07T19:42:58.3657947Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3658335Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3658412Z bogomips : 5999.98 2025-05-07T19:42:58.3658538Z clflush size : 64 2025-05-07T19:42:58.3658628Z cache_alignment : 64 2025-05-07T19:42:58.3658749Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3658824Z power management: 2025-05-07T19:42:58.3658828Z 2025-05-07T19:42:58.3658912Z processor : 38 2025-05-07T19:42:58.3658992Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3659062Z cpu family : 6 2025-05-07T19:42:58.3659134Z model : 85 2025-05-07T19:42:58.3659298Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3659377Z stepping : 7 2025-05-07T19:42:58.3659457Z microcode : 0x5003901 2025-05-07T19:42:58.3659538Z cpu MHz : 2999.994 2025-05-07T19:42:58.3659610Z cache size : 36608 KB 2025-05-07T19:42:58.3659685Z physical id : 1 2025-05-07T19:42:58.3659761Z siblings : 48 2025-05-07T19:42:58.3659843Z core id : 14 2025-05-07T19:42:58.3659915Z cpu cores : 24 2025-05-07T19:42:58.3659990Z apicid : 92 2025-05-07T19:42:58.3660079Z initial apicid : 92 2025-05-07T19:42:58.3660150Z fpu : yes 2025-05-07T19:42:58.3660229Z fpu_exception : yes 2025-05-07T19:42:58.3660307Z cpuid level : 13 2025-05-07T19:42:58.3660409Z wp : yes 2025-05-07T19:42:58.3662450Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3662830Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3662909Z bogomips : 5999.98 2025-05-07T19:42:58.3662982Z clflush size : 64 2025-05-07T19:42:58.3663065Z cache_alignment : 64 2025-05-07T19:42:58.3663197Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3663325Z power management: 2025-05-07T19:42:58.3663329Z 2025-05-07T19:42:58.3663400Z processor : 39 2025-05-07T19:42:58.3663492Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3663566Z cpu family : 6 2025-05-07T19:42:58.3663638Z model : 85 2025-05-07T19:42:58.3663789Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3663890Z stepping : 7 2025-05-07T19:42:58.3663979Z microcode : 0x5003901 2025-05-07T19:42:58.3664055Z cpu MHz : 2999.994 2025-05-07T19:42:58.3664144Z cache size : 36608 KB 2025-05-07T19:42:58.3664230Z physical id : 1 2025-05-07T19:42:58.3664318Z siblings : 48 2025-05-07T19:42:58.3664392Z core id : 15 2025-05-07T19:42:58.3664488Z cpu cores : 24 2025-05-07T19:42:58.3664565Z apicid : 94 2025-05-07T19:42:58.3664647Z initial apicid : 94 2025-05-07T19:42:58.3664739Z fpu : yes 2025-05-07T19:42:58.3664819Z fpu_exception : yes 2025-05-07T19:42:58.3664907Z cpuid level : 13 2025-05-07T19:42:58.3664984Z wp : yes 2025-05-07T19:42:58.3667049Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3667413Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3667520Z bogomips : 5999.98 2025-05-07T19:42:58.3667599Z clflush size : 64 2025-05-07T19:42:58.3667695Z cache_alignment : 64 2025-05-07T19:42:58.3667868Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3667980Z power management: 2025-05-07T19:42:58.3667989Z 2025-05-07T19:42:58.3668074Z processor : 40 2025-05-07T19:42:58.3668163Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3668243Z cpu family : 6 2025-05-07T19:42:58.3668324Z model : 85 2025-05-07T19:42:58.3668472Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3668545Z stepping : 7 2025-05-07T19:42:58.3668629Z microcode : 0x5003901 2025-05-07T19:42:58.3668704Z cpu MHz : 2999.994 2025-05-07T19:42:58.3668776Z cache size : 36608 KB 2025-05-07T19:42:58.3668857Z physical id : 1 2025-05-07T19:42:58.3668931Z siblings : 48 2025-05-07T19:42:58.3668999Z core id : 16 2025-05-07T19:42:58.3669072Z cpu cores : 24 2025-05-07T19:42:58.3669152Z apicid : 96 2025-05-07T19:42:58.3669230Z initial apicid : 96 2025-05-07T19:42:58.3669302Z fpu : yes 2025-05-07T19:42:58.3669387Z fpu_exception : yes 2025-05-07T19:42:58.3669461Z cpuid level : 13 2025-05-07T19:42:58.3669535Z wp : yes 2025-05-07T19:42:58.3671559Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3671927Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3672003Z bogomips : 5999.98 2025-05-07T19:42:58.3672092Z clflush size : 64 2025-05-07T19:42:58.3672169Z cache_alignment : 64 2025-05-07T19:42:58.3672288Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3672368Z power management: 2025-05-07T19:42:58.3672372Z 2025-05-07T19:42:58.3672594Z processor : 41 2025-05-07T19:42:58.3672678Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3672749Z cpu family : 6 2025-05-07T19:42:58.3672835Z model : 85 2025-05-07T19:42:58.3673156Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3673233Z stepping : 7 2025-05-07T19:42:58.3673315Z microcode : 0x5003901 2025-05-07T19:42:58.3673416Z cpu MHz : 2999.994 2025-05-07T19:42:58.3673502Z cache size : 36608 KB 2025-05-07T19:42:58.3673577Z physical id : 1 2025-05-07T19:42:58.3673675Z siblings : 48 2025-05-07T19:42:58.3673807Z core id : 17 2025-05-07T19:42:58.3673889Z cpu cores : 24 2025-05-07T19:42:58.3673969Z apicid : 98 2025-05-07T19:42:58.3674070Z initial apicid : 98 2025-05-07T19:42:58.3674154Z fpu : yes 2025-05-07T19:42:58.3674241Z fpu_exception : yes 2025-05-07T19:42:58.3674324Z cpuid level : 13 2025-05-07T19:42:58.3674422Z wp : yes 2025-05-07T19:42:58.3676623Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3677040Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3677129Z bogomips : 5999.98 2025-05-07T19:42:58.3677215Z clflush size : 64 2025-05-07T19:42:58.3677317Z cache_alignment : 64 2025-05-07T19:42:58.3677452Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3677539Z power management: 2025-05-07T19:42:58.3677597Z 2025-05-07T19:42:58.3677681Z processor : 42 2025-05-07T19:42:58.3677806Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3677896Z cpu family : 6 2025-05-07T19:42:58.3677974Z model : 85 2025-05-07T19:42:58.3678161Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3678248Z stepping : 7 2025-05-07T19:42:58.3678338Z microcode : 0x5003901 2025-05-07T19:42:58.3678419Z cpu MHz : 2999.994 2025-05-07T19:42:58.3678524Z cache size : 36608 KB 2025-05-07T19:42:58.3678615Z physical id : 1 2025-05-07T19:42:58.3678696Z siblings : 48 2025-05-07T19:42:58.3678798Z core id : 18 2025-05-07T19:42:58.3678889Z cpu cores : 24 2025-05-07T19:42:58.3678972Z apicid : 100 2025-05-07T19:42:58.3679059Z initial apicid : 100 2025-05-07T19:42:58.3679156Z fpu : yes 2025-05-07T19:42:58.3679251Z fpu_exception : yes 2025-05-07T19:42:58.3679332Z cpuid level : 13 2025-05-07T19:42:58.3679410Z wp : yes 2025-05-07T19:42:58.3681655Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3682063Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3682167Z bogomips : 5999.98 2025-05-07T19:42:58.3682257Z clflush size : 64 2025-05-07T19:42:58.3682347Z cache_alignment : 64 2025-05-07T19:42:58.3682487Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3682585Z power management: 2025-05-07T19:42:58.3682589Z 2025-05-07T19:42:58.3682687Z processor : 43 2025-05-07T19:42:58.3682780Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3682925Z cpu family : 6 2025-05-07T19:42:58.3683006Z model : 85 2025-05-07T19:42:58.3683173Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3683275Z stepping : 7 2025-05-07T19:42:58.3683362Z microcode : 0x5003901 2025-05-07T19:42:58.3683454Z cpu MHz : 2999.994 2025-05-07T19:42:58.3683542Z cache size : 36608 KB 2025-05-07T19:42:58.3683647Z physical id : 1 2025-05-07T19:42:58.3683728Z siblings : 48 2025-05-07T19:42:58.3683814Z core id : 19 2025-05-07T19:42:58.3683900Z cpu cores : 24 2025-05-07T19:42:58.3683997Z apicid : 102 2025-05-07T19:42:58.3684083Z initial apicid : 102 2025-05-07T19:42:58.3684166Z fpu : yes 2025-05-07T19:42:58.3684263Z fpu_exception : yes 2025-05-07T19:42:58.3684340Z cpuid level : 13 2025-05-07T19:42:58.3684416Z wp : yes 2025-05-07T19:42:58.3686632Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3686998Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3687075Z bogomips : 5999.98 2025-05-07T19:42:58.3687176Z clflush size : 64 2025-05-07T19:42:58.3687264Z cache_alignment : 64 2025-05-07T19:42:58.3687385Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3687468Z power management: 2025-05-07T19:42:58.3687493Z 2025-05-07T19:42:58.3687563Z processor : 44 2025-05-07T19:42:58.3687702Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3687785Z cpu family : 6 2025-05-07T19:42:58.3687882Z model : 85 2025-05-07T19:42:58.3688034Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3688117Z stepping : 7 2025-05-07T19:42:58.3688203Z microcode : 0x5003901 2025-05-07T19:42:58.3688305Z cpu MHz : 2999.994 2025-05-07T19:42:58.3688393Z cache size : 36608 KB 2025-05-07T19:42:58.3688474Z physical id : 1 2025-05-07T19:42:58.3688579Z siblings : 48 2025-05-07T19:42:58.3688658Z core id : 20 2025-05-07T19:42:58.3688743Z cpu cores : 24 2025-05-07T19:42:58.3688821Z apicid : 104 2025-05-07T19:42:58.3688915Z initial apicid : 104 2025-05-07T19:42:58.3688988Z fpu : yes 2025-05-07T19:42:58.3689074Z fpu_exception : yes 2025-05-07T19:42:58.3689182Z cpuid level : 13 2025-05-07T19:42:58.3689253Z wp : yes 2025-05-07T19:42:58.3691523Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3691928Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3692022Z bogomips : 5999.98 2025-05-07T19:42:58.3692104Z clflush size : 64 2025-05-07T19:42:58.3692207Z cache_alignment : 64 2025-05-07T19:42:58.3692342Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3692430Z power management: 2025-05-07T19:42:58.3692435Z 2025-05-07T19:42:58.3692517Z processor : 45 2025-05-07T19:42:58.3692626Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3692723Z cpu family : 6 2025-05-07T19:42:58.3692802Z model : 85 2025-05-07T19:42:58.3692973Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3693117Z stepping : 7 2025-05-07T19:42:58.3693209Z microcode : 0x5003901 2025-05-07T19:42:58.3693290Z cpu MHz : 2999.994 2025-05-07T19:42:58.3693390Z cache size : 36608 KB 2025-05-07T19:42:58.3693474Z physical id : 1 2025-05-07T19:42:58.3693560Z siblings : 48 2025-05-07T19:42:58.3693647Z core id : 21 2025-05-07T19:42:58.3693730Z cpu cores : 24 2025-05-07T19:42:58.3693811Z apicid : 106 2025-05-07T19:42:58.3693901Z initial apicid : 106 2025-05-07T19:42:58.3693998Z fpu : yes 2025-05-07T19:42:58.3694093Z fpu_exception : yes 2025-05-07T19:42:58.3694185Z cpuid level : 13 2025-05-07T19:42:58.3694274Z wp : yes 2025-05-07T19:42:58.3696436Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3696828Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3696945Z bogomips : 5999.98 2025-05-07T19:42:58.3697035Z clflush size : 64 2025-05-07T19:42:58.3697129Z cache_alignment : 64 2025-05-07T19:42:58.3697291Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3697372Z power management: 2025-05-07T19:42:58.3697376Z 2025-05-07T19:42:58.3697458Z processor : 46 2025-05-07T19:42:58.3697557Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3697671Z cpu family : 6 2025-05-07T19:42:58.3697755Z model : 85 2025-05-07T19:42:58.3697966Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3698074Z stepping : 7 2025-05-07T19:42:58.3698169Z microcode : 0x5003901 2025-05-07T19:42:58.3698263Z cpu MHz : 2999.994 2025-05-07T19:42:58.3698351Z cache size : 36608 KB 2025-05-07T19:42:58.3698468Z physical id : 1 2025-05-07T19:42:58.3698544Z siblings : 48 2025-05-07T19:42:58.3698638Z core id : 22 2025-05-07T19:42:58.3698747Z cpu cores : 24 2025-05-07T19:42:58.3698830Z apicid : 108 2025-05-07T19:42:58.3698929Z initial apicid : 108 2025-05-07T19:42:58.3699000Z fpu : yes 2025-05-07T19:42:58.3699116Z fpu_exception : yes 2025-05-07T19:42:58.3699212Z cpuid level : 13 2025-05-07T19:42:58.3699299Z wp : yes 2025-05-07T19:42:58.3701594Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3701965Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3702043Z bogomips : 5999.98 2025-05-07T19:42:58.3702128Z clflush size : 64 2025-05-07T19:42:58.3702211Z cache_alignment : 64 2025-05-07T19:42:58.3702348Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3702460Z power management: 2025-05-07T19:42:58.3702464Z 2025-05-07T19:42:58.3702549Z processor : 47 2025-05-07T19:42:58.3702643Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3702731Z cpu family : 6 2025-05-07T19:42:58.3702831Z model : 85 2025-05-07T19:42:58.3702988Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3703060Z stepping : 7 2025-05-07T19:42:58.3703193Z microcode : 0x5003901 2025-05-07T19:42:58.3703265Z cpu MHz : 2999.994 2025-05-07T19:42:58.3703352Z cache size : 36608 KB 2025-05-07T19:42:58.3703436Z physical id : 1 2025-05-07T19:42:58.3703514Z siblings : 48 2025-05-07T19:42:58.3703585Z core id : 23 2025-05-07T19:42:58.3703653Z cpu cores : 24 2025-05-07T19:42:58.3703731Z apicid : 110 2025-05-07T19:42:58.3703810Z initial apicid : 110 2025-05-07T19:42:58.3703878Z fpu : yes 2025-05-07T19:42:58.3703963Z fpu_exception : yes 2025-05-07T19:42:58.3704051Z cpuid level : 13 2025-05-07T19:42:58.3704135Z wp : yes 2025-05-07T19:42:58.3706190Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3706586Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3706673Z bogomips : 5999.98 2025-05-07T19:42:58.3706760Z clflush size : 64 2025-05-07T19:42:58.3706855Z cache_alignment : 64 2025-05-07T19:42:58.3706981Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3707074Z power management: 2025-05-07T19:42:58.3707078Z 2025-05-07T19:42:58.3707170Z processor : 48 2025-05-07T19:42:58.3707258Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3707344Z cpu family : 6 2025-05-07T19:42:58.3707428Z model : 85 2025-05-07T19:42:58.3707605Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3707731Z stepping : 7 2025-05-07T19:42:58.3707807Z microcode : 0x5003901 2025-05-07T19:42:58.3707916Z cpu MHz : 3179.932 2025-05-07T19:42:58.3708004Z cache size : 36608 KB 2025-05-07T19:42:58.3708083Z physical id : 0 2025-05-07T19:42:58.3708157Z siblings : 48 2025-05-07T19:42:58.3708246Z core id : 0 2025-05-07T19:42:58.3708334Z cpu cores : 24 2025-05-07T19:42:58.3708411Z apicid : 1 2025-05-07T19:42:58.3708500Z initial apicid : 1 2025-05-07T19:42:58.3708584Z fpu : yes 2025-05-07T19:42:58.3708674Z fpu_exception : yes 2025-05-07T19:42:58.3708761Z cpuid level : 13 2025-05-07T19:42:58.3708844Z wp : yes 2025-05-07T19:42:58.3710887Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3711262Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3711358Z bogomips : 5999.98 2025-05-07T19:42:58.3711450Z clflush size : 64 2025-05-07T19:42:58.3711535Z cache_alignment : 64 2025-05-07T19:42:58.3711689Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3711763Z power management: 2025-05-07T19:42:58.3711767Z 2025-05-07T19:42:58.3711852Z processor : 49 2025-05-07T19:42:58.3711960Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3712042Z cpu family : 6 2025-05-07T19:42:58.3712118Z model : 85 2025-05-07T19:42:58.3712263Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3712360Z stepping : 7 2025-05-07T19:42:58.3712501Z microcode : 0x5003901 2025-05-07T19:42:58.3712584Z cpu MHz : 2999.994 2025-05-07T19:42:58.3712733Z cache size : 36608 KB 2025-05-07T19:42:58.3712819Z physical id : 0 2025-05-07T19:42:58.3713053Z siblings : 48 2025-05-07T19:42:58.3713144Z core id : 1 2025-05-07T19:42:58.3713246Z cpu cores : 24 2025-05-07T19:42:58.3713330Z apicid : 3 2025-05-07T19:42:58.3713421Z initial apicid : 3 2025-05-07T19:42:58.3713513Z fpu : yes 2025-05-07T19:42:58.3713640Z fpu_exception : yes 2025-05-07T19:42:58.3713728Z cpuid level : 13 2025-05-07T19:42:58.3713886Z wp : yes 2025-05-07T19:42:58.3716150Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3716554Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3716644Z bogomips : 5999.98 2025-05-07T19:42:58.3716761Z clflush size : 64 2025-05-07T19:42:58.3716843Z cache_alignment : 64 2025-05-07T19:42:58.3716985Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3717094Z power management: 2025-05-07T19:42:58.3717098Z 2025-05-07T19:42:58.3717187Z processor : 50 2025-05-07T19:42:58.3717276Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3717358Z cpu family : 6 2025-05-07T19:42:58.3717432Z model : 85 2025-05-07T19:42:58.3717593Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3717669Z stepping : 7 2025-05-07T19:42:58.3717770Z microcode : 0x5003901 2025-05-07T19:42:58.3717894Z cpu MHz : 2999.994 2025-05-07T19:42:58.3717972Z cache size : 36608 KB 2025-05-07T19:42:58.3718053Z physical id : 0 2025-05-07T19:42:58.3718154Z siblings : 48 2025-05-07T19:42:58.3718238Z core id : 2 2025-05-07T19:42:58.3718325Z cpu cores : 24 2025-05-07T19:42:58.3718439Z apicid : 5 2025-05-07T19:42:58.3718538Z initial apicid : 5 2025-05-07T19:42:58.3718619Z fpu : yes 2025-05-07T19:42:58.3718716Z fpu_exception : yes 2025-05-07T19:42:58.3718826Z cpuid level : 13 2025-05-07T19:42:58.3718904Z wp : yes 2025-05-07T19:42:58.3721133Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3721543Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3721637Z bogomips : 5999.98 2025-05-07T19:42:58.3721730Z clflush size : 64 2025-05-07T19:42:58.3721841Z cache_alignment : 64 2025-05-07T19:42:58.3721971Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3722049Z power management: 2025-05-07T19:42:58.3722053Z 2025-05-07T19:42:58.3722170Z processor : 51 2025-05-07T19:42:58.3722257Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3722338Z cpu family : 6 2025-05-07T19:42:58.3722417Z model : 85 2025-05-07T19:42:58.3722607Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3722700Z stepping : 7 2025-05-07T19:42:58.3722789Z microcode : 0x5003901 2025-05-07T19:42:58.3722893Z cpu MHz : 3226.772 2025-05-07T19:42:58.3722990Z cache size : 36608 KB 2025-05-07T19:42:58.3723084Z physical id : 0 2025-05-07T19:42:58.3723214Z siblings : 48 2025-05-07T19:42:58.3723312Z core id : 3 2025-05-07T19:42:58.3723403Z cpu cores : 24 2025-05-07T19:42:58.3723489Z apicid : 7 2025-05-07T19:42:58.3723596Z initial apicid : 7 2025-05-07T19:42:58.3723681Z fpu : yes 2025-05-07T19:42:58.3723777Z fpu_exception : yes 2025-05-07T19:42:58.3723869Z cpuid level : 13 2025-05-07T19:42:58.3723973Z wp : yes 2025-05-07T19:42:58.3726224Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3726617Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3726701Z bogomips : 5999.98 2025-05-07T19:42:58.3726787Z clflush size : 64 2025-05-07T19:42:58.3726877Z cache_alignment : 64 2025-05-07T19:42:58.3727024Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3727111Z power management: 2025-05-07T19:42:58.3727115Z 2025-05-07T19:42:58.3727199Z processor : 52 2025-05-07T19:42:58.3727318Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3727396Z cpu family : 6 2025-05-07T19:42:58.3727477Z model : 85 2025-05-07T19:42:58.3727633Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3727745Z stepping : 7 2025-05-07T19:42:58.3727826Z microcode : 0x5003901 2025-05-07T19:42:58.3727910Z cpu MHz : 3236.958 2025-05-07T19:42:58.3728026Z cache size : 36608 KB 2025-05-07T19:42:58.3728149Z physical id : 0 2025-05-07T19:42:58.3728230Z siblings : 48 2025-05-07T19:42:58.3728311Z core id : 4 2025-05-07T19:42:58.3728402Z cpu cores : 24 2025-05-07T19:42:58.3728482Z apicid : 9 2025-05-07T19:42:58.3728572Z initial apicid : 9 2025-05-07T19:42:58.3728676Z fpu : yes 2025-05-07T19:42:58.3728767Z fpu_exception : yes 2025-05-07T19:42:58.3728857Z cpuid level : 13 2025-05-07T19:42:58.3728937Z wp : yes 2025-05-07T19:42:58.3731018Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3731522Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3731808Z bogomips : 5999.98 2025-05-07T19:42:58.3731894Z clflush size : 64 2025-05-07T19:42:58.3731989Z cache_alignment : 64 2025-05-07T19:42:58.3732124Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3732244Z power management: 2025-05-07T19:42:58.3732248Z 2025-05-07T19:42:58.3732418Z processor : 53 2025-05-07T19:42:58.3732515Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3732623Z cpu family : 6 2025-05-07T19:42:58.3732699Z model : 85 2025-05-07T19:42:58.3732865Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3732948Z stepping : 7 2025-05-07T19:42:58.3733067Z microcode : 0x5003901 2025-05-07T19:42:58.3733147Z cpu MHz : 3227.893 2025-05-07T19:42:58.3733231Z cache size : 36608 KB 2025-05-07T19:42:58.3733343Z physical id : 0 2025-05-07T19:42:58.3733440Z siblings : 48 2025-05-07T19:42:58.3733530Z core id : 5 2025-05-07T19:42:58.3733618Z cpu cores : 24 2025-05-07T19:42:58.3733832Z apicid : 11 2025-05-07T19:42:58.3733926Z initial apicid : 11 2025-05-07T19:42:58.3734020Z fpu : yes 2025-05-07T19:42:58.3734147Z fpu_exception : yes 2025-05-07T19:42:58.3734241Z cpuid level : 13 2025-05-07T19:42:58.3734324Z wp : yes 2025-05-07T19:42:58.3736537Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3736977Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3737079Z bogomips : 5999.98 2025-05-07T19:42:58.3737189Z clflush size : 64 2025-05-07T19:42:58.3737287Z cache_alignment : 64 2025-05-07T19:42:58.3737435Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3737536Z power management: 2025-05-07T19:42:58.3737541Z 2025-05-07T19:42:58.3737650Z processor : 54 2025-05-07T19:42:58.3737755Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3737849Z cpu family : 6 2025-05-07T19:42:58.3737958Z model : 85 2025-05-07T19:42:58.3738127Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3738219Z stepping : 7 2025-05-07T19:42:58.3738315Z microcode : 0x5003901 2025-05-07T19:42:58.3738418Z cpu MHz : 2999.994 2025-05-07T19:42:58.3738516Z cache size : 36608 KB 2025-05-07T19:42:58.3738612Z physical id : 0 2025-05-07T19:42:58.3738722Z siblings : 48 2025-05-07T19:42:58.3738819Z core id : 6 2025-05-07T19:42:58.3738982Z cpu cores : 24 2025-05-07T19:42:58.3739067Z apicid : 13 2025-05-07T19:42:58.3739177Z initial apicid : 13 2025-05-07T19:42:58.3739269Z fpu : yes 2025-05-07T19:42:58.3739368Z fpu_exception : yes 2025-05-07T19:42:58.3739455Z cpuid level : 13 2025-05-07T19:42:58.3739570Z wp : yes 2025-05-07T19:42:58.3741793Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3742212Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3742307Z bogomips : 5999.98 2025-05-07T19:42:58.3742397Z clflush size : 64 2025-05-07T19:42:58.3742508Z cache_alignment : 64 2025-05-07T19:42:58.3742651Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3742751Z power management: 2025-05-07T19:42:58.3742756Z 2025-05-07T19:42:58.3742847Z processor : 55 2025-05-07T19:42:58.3742961Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3743056Z cpu family : 6 2025-05-07T19:42:58.3743144Z model : 85 2025-05-07T19:42:58.3743340Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3743429Z stepping : 7 2025-05-07T19:42:58.3743522Z microcode : 0x5003901 2025-05-07T19:42:58.3743614Z cpu MHz : 3218.200 2025-05-07T19:42:58.3743722Z cache size : 36608 KB 2025-05-07T19:42:58.3743807Z physical id : 0 2025-05-07T19:42:58.3743896Z siblings : 48 2025-05-07T19:42:58.3744002Z core id : 7 2025-05-07T19:42:58.3744082Z cpu cores : 24 2025-05-07T19:42:58.3744167Z apicid : 15 2025-05-07T19:42:58.3744256Z initial apicid : 15 2025-05-07T19:42:58.3744514Z fpu : yes 2025-05-07T19:42:58.3744599Z fpu_exception : yes 2025-05-07T19:42:58.3744791Z cpuid level : 13 2025-05-07T19:42:58.3744869Z wp : yes 2025-05-07T19:42:58.3746924Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3747293Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3747405Z bogomips : 5999.98 2025-05-07T19:42:58.3747500Z clflush size : 64 2025-05-07T19:42:58.3747579Z cache_alignment : 64 2025-05-07T19:42:58.3747703Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3747811Z power management: 2025-05-07T19:42:58.3747815Z 2025-05-07T19:42:58.3747902Z processor : 56 2025-05-07T19:42:58.3747989Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3748100Z cpu family : 6 2025-05-07T19:42:58.3748180Z model : 85 2025-05-07T19:42:58.3748336Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3748439Z stepping : 7 2025-05-07T19:42:58.3748526Z microcode : 0x5003901 2025-05-07T19:42:58.3748611Z cpu MHz : 3175.512 2025-05-07T19:42:58.3748689Z cache size : 36608 KB 2025-05-07T19:42:58.3748796Z physical id : 0 2025-05-07T19:42:58.3748874Z siblings : 48 2025-05-07T19:42:58.3748950Z core id : 8 2025-05-07T19:42:58.3749035Z cpu cores : 24 2025-05-07T19:42:58.3749138Z apicid : 17 2025-05-07T19:42:58.3749279Z initial apicid : 17 2025-05-07T19:42:58.3749359Z fpu : yes 2025-05-07T19:42:58.3749468Z fpu_exception : yes 2025-05-07T19:42:58.3749553Z cpuid level : 13 2025-05-07T19:42:58.3749627Z wp : yes 2025-05-07T19:42:58.3751687Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3752064Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3752144Z bogomips : 5999.98 2025-05-07T19:42:58.3752253Z clflush size : 64 2025-05-07T19:42:58.3752343Z cache_alignment : 64 2025-05-07T19:42:58.3752543Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3752628Z power management: 2025-05-07T19:42:58.3752656Z 2025-05-07T19:42:58.3752747Z processor : 57 2025-05-07T19:42:58.3752843Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3753091Z cpu family : 6 2025-05-07T19:42:58.3753198Z model : 85 2025-05-07T19:42:58.3753366Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3753444Z stepping : 7 2025-05-07T19:42:58.3753540Z microcode : 0x5003901 2025-05-07T19:42:58.3753642Z cpu MHz : 3205.772 2025-05-07T19:42:58.3753735Z cache size : 36608 KB 2025-05-07T19:42:58.3753889Z physical id : 0 2025-05-07T19:42:58.3754009Z siblings : 48 2025-05-07T19:42:58.3754089Z core id : 9 2025-05-07T19:42:58.3754177Z cpu cores : 24 2025-05-07T19:42:58.3754268Z apicid : 19 2025-05-07T19:42:58.3754376Z initial apicid : 19 2025-05-07T19:42:58.3754459Z fpu : yes 2025-05-07T19:42:58.3754551Z fpu_exception : yes 2025-05-07T19:42:58.3754662Z cpuid level : 13 2025-05-07T19:42:58.3754803Z wp : yes 2025-05-07T19:42:58.3757024Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3757450Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3757539Z bogomips : 5999.98 2025-05-07T19:42:58.3757627Z clflush size : 64 2025-05-07T19:42:58.3757750Z cache_alignment : 64 2025-05-07T19:42:58.3757889Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3757981Z power management: 2025-05-07T19:42:58.3757985Z 2025-05-07T19:42:58.3758074Z processor : 58 2025-05-07T19:42:58.3758203Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3758294Z cpu family : 6 2025-05-07T19:42:58.3758378Z model : 85 2025-05-07T19:42:58.3758567Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3758656Z stepping : 7 2025-05-07T19:42:58.3758741Z microcode : 0x5003901 2025-05-07T19:42:58.3758825Z cpu MHz : 2999.994 2025-05-07T19:42:58.3758943Z cache size : 36608 KB 2025-05-07T19:42:58.3759036Z physical id : 0 2025-05-07T19:42:58.3759118Z siblings : 48 2025-05-07T19:42:58.3759226Z core id : 10 2025-05-07T19:42:58.3759310Z cpu cores : 24 2025-05-07T19:42:58.3759394Z apicid : 21 2025-05-07T19:42:58.3759491Z initial apicid : 21 2025-05-07T19:42:58.3759591Z fpu : yes 2025-05-07T19:42:58.3759678Z fpu_exception : yes 2025-05-07T19:42:58.3759817Z cpuid level : 13 2025-05-07T19:42:58.3759919Z wp : yes 2025-05-07T19:42:58.3762136Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3762531Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3762640Z bogomips : 5999.98 2025-05-07T19:42:58.3762731Z clflush size : 64 2025-05-07T19:42:58.3762818Z cache_alignment : 64 2025-05-07T19:42:58.3762982Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3763084Z power management: 2025-05-07T19:42:58.3763089Z 2025-05-07T19:42:58.3763173Z processor : 59 2025-05-07T19:42:58.3763272Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3763392Z cpu family : 6 2025-05-07T19:42:58.3763476Z model : 85 2025-05-07T19:42:58.3763642Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3763744Z stepping : 7 2025-05-07T19:42:58.3763841Z microcode : 0x5003901 2025-05-07T19:42:58.3763929Z cpu MHz : 3223.978 2025-05-07T19:42:58.3764007Z cache size : 36608 KB 2025-05-07T19:42:58.3764112Z physical id : 0 2025-05-07T19:42:58.3764201Z siblings : 48 2025-05-07T19:42:58.3764287Z core id : 11 2025-05-07T19:42:58.3764392Z cpu cores : 24 2025-05-07T19:42:58.3764482Z apicid : 23 2025-05-07T19:42:58.3764577Z initial apicid : 23 2025-05-07T19:42:58.3764661Z fpu : yes 2025-05-07T19:42:58.3764769Z fpu_exception : yes 2025-05-07T19:42:58.3764854Z cpuid level : 13 2025-05-07T19:42:58.3764949Z wp : yes 2025-05-07T19:42:58.3767117Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3767544Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3767636Z bogomips : 5999.98 2025-05-07T19:42:58.3767737Z clflush size : 64 2025-05-07T19:42:58.3767827Z cache_alignment : 64 2025-05-07T19:42:58.3767963Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3768082Z power management: 2025-05-07T19:42:58.3768089Z 2025-05-07T19:42:58.3768173Z processor : 60 2025-05-07T19:42:58.3768271Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3768359Z cpu family : 6 2025-05-07T19:42:58.3768460Z model : 85 2025-05-07T19:42:58.3768611Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3768700Z stepping : 7 2025-05-07T19:42:58.3768819Z microcode : 0x5003901 2025-05-07T19:42:58.3768902Z cpu MHz : 3244.637 2025-05-07T19:42:58.3768982Z cache size : 36608 KB 2025-05-07T19:42:58.3769070Z physical id : 0 2025-05-07T19:42:58.3769164Z siblings : 48 2025-05-07T19:42:58.3769243Z core id : 12 2025-05-07T19:42:58.3769335Z cpu cores : 24 2025-05-07T19:42:58.3769431Z apicid : 25 2025-05-07T19:42:58.3769517Z initial apicid : 25 2025-05-07T19:42:58.3769595Z fpu : yes 2025-05-07T19:42:58.3769684Z fpu_exception : yes 2025-05-07T19:42:58.3769779Z cpuid level : 13 2025-05-07T19:42:58.3769860Z wp : yes 2025-05-07T19:42:58.3772428Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3772834Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3772922Z bogomips : 5999.98 2025-05-07T19:42:58.3773005Z clflush size : 64 2025-05-07T19:42:58.3773126Z cache_alignment : 64 2025-05-07T19:42:58.3773257Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3773362Z power management: 2025-05-07T19:42:58.3773367Z 2025-05-07T19:42:58.3773487Z processor : 61 2025-05-07T19:42:58.3773591Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3773685Z cpu family : 6 2025-05-07T19:42:58.3773776Z model : 85 2025-05-07T19:42:58.3773972Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3774071Z stepping : 7 2025-05-07T19:42:58.3774165Z microcode : 0x5003901 2025-05-07T19:42:58.3774292Z cpu MHz : 2999.994 2025-05-07T19:42:58.3774391Z cache size : 36608 KB 2025-05-07T19:42:58.3774485Z physical id : 0 2025-05-07T19:42:58.3774575Z siblings : 48 2025-05-07T19:42:58.3774703Z core id : 13 2025-05-07T19:42:58.3774794Z cpu cores : 24 2025-05-07T19:42:58.3774887Z apicid : 27 2025-05-07T19:42:58.3774988Z initial apicid : 27 2025-05-07T19:42:58.3775114Z fpu : yes 2025-05-07T19:42:58.3775213Z fpu_exception : yes 2025-05-07T19:42:58.3775309Z cpuid level : 13 2025-05-07T19:42:58.3775426Z wp : yes 2025-05-07T19:42:58.3777476Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3777908Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3778041Z bogomips : 5999.98 2025-05-07T19:42:58.3778139Z clflush size : 64 2025-05-07T19:42:58.3778241Z cache_alignment : 64 2025-05-07T19:42:58.3778417Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3778513Z power management: 2025-05-07T19:42:58.3778517Z 2025-05-07T19:42:58.3778618Z processor : 62 2025-05-07T19:42:58.3778759Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3778858Z cpu family : 6 2025-05-07T19:42:58.3778949Z model : 85 2025-05-07T19:42:58.3779120Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3779241Z stepping : 7 2025-05-07T19:42:58.3779340Z microcode : 0x5003901 2025-05-07T19:42:58.3779437Z cpu MHz : 3233.635 2025-05-07T19:42:58.3779562Z cache size : 36608 KB 2025-05-07T19:42:58.3779662Z physical id : 0 2025-05-07T19:42:58.3779756Z siblings : 48 2025-05-07T19:42:58.3779852Z core id : 14 2025-05-07T19:42:58.3779982Z cpu cores : 24 2025-05-07T19:42:58.3780082Z apicid : 29 2025-05-07T19:42:58.3780181Z initial apicid : 29 2025-05-07T19:42:58.3780277Z fpu : yes 2025-05-07T19:42:58.3780413Z fpu_exception : yes 2025-05-07T19:42:58.3780510Z cpuid level : 13 2025-05-07T19:42:58.3780602Z wp : yes 2025-05-07T19:42:58.3782758Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3783150Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3783252Z bogomips : 5999.98 2025-05-07T19:42:58.3783378Z clflush size : 64 2025-05-07T19:42:58.3783479Z cache_alignment : 64 2025-05-07T19:42:58.3783615Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3783736Z power management: 2025-05-07T19:42:58.3783740Z 2025-05-07T19:42:58.3783832Z processor : 63 2025-05-07T19:42:58.3783933Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3784057Z cpu family : 6 2025-05-07T19:42:58.3784150Z model : 85 2025-05-07T19:42:58.3784315Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3784409Z stepping : 7 2025-05-07T19:42:58.3784532Z microcode : 0x5003901 2025-05-07T19:42:58.3784626Z cpu MHz : 3080.412 2025-05-07T19:42:58.3784719Z cache size : 36608 KB 2025-05-07T19:42:58.3784810Z physical id : 0 2025-05-07T19:42:58.3784923Z siblings : 48 2025-05-07T19:42:58.3785010Z core id : 15 2025-05-07T19:42:58.3785101Z cpu cores : 24 2025-05-07T19:42:58.3785214Z apicid : 31 2025-05-07T19:42:58.3785309Z initial apicid : 31 2025-05-07T19:42:58.3785399Z fpu : yes 2025-05-07T19:42:58.3785495Z fpu_exception : yes 2025-05-07T19:42:58.3785617Z cpuid level : 13 2025-05-07T19:42:58.3785707Z wp : yes 2025-05-07T19:42:58.3787762Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3788221Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3788314Z bogomips : 5999.98 2025-05-07T19:42:58.3788406Z clflush size : 64 2025-05-07T19:42:58.3788532Z cache_alignment : 64 2025-05-07T19:42:58.3788672Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3788775Z power management: 2025-05-07T19:42:58.3788779Z 2025-05-07T19:42:58.3788904Z processor : 64 2025-05-07T19:42:58.3789004Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3789102Z cpu family : 6 2025-05-07T19:42:58.3789190Z model : 85 2025-05-07T19:42:58.3789386Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3789478Z stepping : 7 2025-05-07T19:42:58.3789573Z microcode : 0x5003901 2025-05-07T19:42:58.3789693Z cpu MHz : 2999.994 2025-05-07T19:42:58.3789787Z cache size : 36608 KB 2025-05-07T19:42:58.3789881Z physical id : 0 2025-05-07T19:42:58.3789972Z siblings : 48 2025-05-07T19:42:58.3790092Z core id : 16 2025-05-07T19:42:58.3790182Z cpu cores : 24 2025-05-07T19:42:58.3790274Z apicid : 33 2025-05-07T19:42:58.3790393Z initial apicid : 33 2025-05-07T19:42:58.3790480Z fpu : yes 2025-05-07T19:42:58.3790577Z fpu_exception : yes 2025-05-07T19:42:58.3790668Z cpuid level : 13 2025-05-07T19:42:58.3790780Z wp : yes 2025-05-07T19:42:58.3793158Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3793610Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3793715Z bogomips : 5999.98 2025-05-07T19:42:58.3793905Z clflush size : 64 2025-05-07T19:42:58.3794011Z cache_alignment : 64 2025-05-07T19:42:58.3794202Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3794307Z power management: 2025-05-07T19:42:58.3794312Z 2025-05-07T19:42:58.3794412Z processor : 65 2025-05-07T19:42:58.3794553Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3794657Z cpu family : 6 2025-05-07T19:42:58.3794764Z model : 85 2025-05-07T19:42:58.3794950Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3795083Z stepping : 7 2025-05-07T19:42:58.3795182Z microcode : 0x5003901 2025-05-07T19:42:58.3795276Z cpu MHz : 2999.994 2025-05-07T19:42:58.3795399Z cache size : 36608 KB 2025-05-07T19:42:58.3795495Z physical id : 0 2025-05-07T19:42:58.3795588Z siblings : 48 2025-05-07T19:42:58.3795680Z core id : 17 2025-05-07T19:42:58.3795799Z cpu cores : 24 2025-05-07T19:42:58.3795893Z apicid : 35 2025-05-07T19:42:58.3795991Z initial apicid : 35 2025-05-07T19:42:58.3796115Z fpu : yes 2025-05-07T19:42:58.3796216Z fpu_exception : yes 2025-05-07T19:42:58.3796316Z cpuid level : 13 2025-05-07T19:42:58.3796412Z wp : yes 2025-05-07T19:42:58.3798662Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3799117Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3799246Z bogomips : 5999.98 2025-05-07T19:42:58.3799344Z clflush size : 64 2025-05-07T19:42:58.3799444Z cache_alignment : 64 2025-05-07T19:42:58.3799589Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3799715Z power management: 2025-05-07T19:42:58.3799719Z 2025-05-07T19:42:58.3799816Z processor : 66 2025-05-07T19:42:58.3799921Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3800046Z cpu family : 6 2025-05-07T19:42:58.3800139Z model : 85 2025-05-07T19:42:58.3800317Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3800418Z stepping : 7 2025-05-07T19:42:58.3800543Z microcode : 0x5003901 2025-05-07T19:42:58.3800639Z cpu MHz : 3238.926 2025-05-07T19:42:58.3800738Z cache size : 36608 KB 2025-05-07T19:42:58.3800863Z physical id : 0 2025-05-07T19:42:58.3800957Z siblings : 48 2025-05-07T19:42:58.3801051Z core id : 18 2025-05-07T19:42:58.3801146Z cpu cores : 24 2025-05-07T19:42:58.3801267Z apicid : 37 2025-05-07T19:42:58.3801367Z initial apicid : 37 2025-05-07T19:42:58.3801463Z fpu : yes 2025-05-07T19:42:58.3801595Z fpu_exception : yes 2025-05-07T19:42:58.3801692Z cpuid level : 13 2025-05-07T19:42:58.3801783Z wp : yes 2025-05-07T19:42:58.3804050Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3804491Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3804590Z bogomips : 5999.98 2025-05-07T19:42:58.3804717Z clflush size : 64 2025-05-07T19:42:58.3804816Z cache_alignment : 64 2025-05-07T19:42:58.3804964Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3805174Z power management: 2025-05-07T19:42:58.3805179Z 2025-05-07T19:42:58.3805295Z processor : 67 2025-05-07T19:42:58.3805392Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3805480Z cpu family : 6 2025-05-07T19:42:58.3805593Z model : 85 2025-05-07T19:42:58.3805763Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3805852Z stepping : 7 2025-05-07T19:42:58.3805946Z microcode : 0x5003901 2025-05-07T19:42:58.3806064Z cpu MHz : 3236.449 2025-05-07T19:42:58.3806155Z cache size : 36608 KB 2025-05-07T19:42:58.3806247Z physical id : 0 2025-05-07T19:42:58.3806364Z siblings : 48 2025-05-07T19:42:58.3806452Z core id : 19 2025-05-07T19:42:58.3806545Z cpu cores : 24 2025-05-07T19:42:58.3806635Z apicid : 39 2025-05-07T19:42:58.3806756Z initial apicid : 39 2025-05-07T19:42:58.3806845Z fpu : yes 2025-05-07T19:42:58.3806943Z fpu_exception : yes 2025-05-07T19:42:58.3807039Z cpuid level : 13 2025-05-07T19:42:58.3807161Z wp : yes 2025-05-07T19:42:58.3809212Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3809673Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3809767Z bogomips : 5999.98 2025-05-07T19:42:58.3809860Z clflush size : 64 2025-05-07T19:42:58.3809985Z cache_alignment : 64 2025-05-07T19:42:58.3810126Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3810221Z power management: 2025-05-07T19:42:58.3810226Z 2025-05-07T19:42:58.3810317Z processor : 68 2025-05-07T19:42:58.3810439Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3810532Z cpu family : 6 2025-05-07T19:42:58.3810619Z model : 85 2025-05-07T19:42:58.3810809Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3810899Z stepping : 7 2025-05-07T19:42:58.3810997Z microcode : 0x5003901 2025-05-07T19:42:58.3811087Z cpu MHz : 3226.229 2025-05-07T19:42:58.3811212Z cache size : 36608 KB 2025-05-07T19:42:58.3811307Z physical id : 0 2025-05-07T19:42:58.3811398Z siblings : 48 2025-05-07T19:42:58.3811519Z core id : 20 2025-05-07T19:42:58.3811610Z cpu cores : 24 2025-05-07T19:42:58.3811698Z apicid : 41 2025-05-07T19:42:58.3811792Z initial apicid : 41 2025-05-07T19:42:58.3811912Z fpu : yes 2025-05-07T19:42:58.3812007Z fpu_exception : yes 2025-05-07T19:42:58.3812098Z cpuid level : 13 2025-05-07T19:42:58.3812187Z wp : yes 2025-05-07T19:42:58.3814322Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3814705Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3814832Z bogomips : 5999.98 2025-05-07T19:42:58.3814931Z clflush size : 64 2025-05-07T19:42:58.3815032Z cache_alignment : 64 2025-05-07T19:42:58.3815172Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3815296Z power management: 2025-05-07T19:42:58.3815300Z 2025-05-07T19:42:58.3815403Z processor : 69 2025-05-07T19:42:58.3815511Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3815638Z cpu family : 6 2025-05-07T19:42:58.3815733Z model : 85 2025-05-07T19:42:58.3815907Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3816035Z stepping : 7 2025-05-07T19:42:58.3816133Z microcode : 0x5003901 2025-05-07T19:42:58.3816229Z cpu MHz : 3312.715 2025-05-07T19:42:58.3816327Z cache size : 36608 KB 2025-05-07T19:42:58.3816456Z physical id : 0 2025-05-07T19:42:58.3816546Z siblings : 48 2025-05-07T19:42:58.3816640Z core id : 21 2025-05-07T19:42:58.3816736Z cpu cores : 24 2025-05-07T19:42:58.3816861Z apicid : 43 2025-05-07T19:42:58.3816958Z initial apicid : 43 2025-05-07T19:42:58.3817047Z fpu : yes 2025-05-07T19:42:58.3817180Z fpu_exception : yes 2025-05-07T19:42:58.3817283Z cpuid level : 13 2025-05-07T19:42:58.3817379Z wp : yes 2025-05-07T19:42:58.3819469Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3819903Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3819996Z bogomips : 5999.98 2025-05-07T19:42:58.3820124Z clflush size : 64 2025-05-07T19:42:58.3820227Z cache_alignment : 64 2025-05-07T19:42:58.3820368Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3820467Z power management: 2025-05-07T19:42:58.3820499Z 2025-05-07T19:42:58.3820590Z processor : 70 2025-05-07T19:42:58.3820694Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3820790Z cpu family : 6 2025-05-07T19:42:58.3820915Z model : 85 2025-05-07T19:42:58.3821086Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3821185Z stepping : 7 2025-05-07T19:42:58.3821282Z microcode : 0x5003901 2025-05-07T19:42:58.3821414Z cpu MHz : 3227.483 2025-05-07T19:42:58.3821517Z cache size : 36608 KB 2025-05-07T19:42:58.3821615Z physical id : 0 2025-05-07T19:42:58.3821735Z siblings : 48 2025-05-07T19:42:58.3821822Z core id : 22 2025-05-07T19:42:58.3821910Z cpu cores : 24 2025-05-07T19:42:58.3821998Z apicid : 45 2025-05-07T19:42:58.3822115Z initial apicid : 45 2025-05-07T19:42:58.3822203Z fpu : yes 2025-05-07T19:42:58.3822302Z fpu_exception : yes 2025-05-07T19:42:58.3822424Z cpuid level : 13 2025-05-07T19:42:58.3822514Z wp : yes 2025-05-07T19:42:58.3824598Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3825004Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3825098Z bogomips : 5999.98 2025-05-07T19:42:58.3825189Z clflush size : 64 2025-05-07T19:42:58.3825314Z cache_alignment : 64 2025-05-07T19:42:58.3825449Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3825540Z power management: 2025-05-07T19:42:58.3825544Z 2025-05-07T19:42:58.3825640Z processor : 71 2025-05-07T19:42:58.3825764Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3825855Z cpu family : 6 2025-05-07T19:42:58.3825942Z model : 85 2025-05-07T19:42:58.3826138Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3826227Z stepping : 7 2025-05-07T19:42:58.3826324Z microcode : 0x5003901 2025-05-07T19:42:58.3826418Z cpu MHz : 3248.807 2025-05-07T19:42:58.3841934Z cache size : 36608 KB 2025-05-07T19:42:58.3842093Z physical id : 0 2025-05-07T19:42:58.3842200Z siblings : 48 2025-05-07T19:42:58.3842294Z core id : 23 2025-05-07T19:42:58.3842385Z cpu cores : 24 2025-05-07T19:42:58.3842467Z apicid : 47 2025-05-07T19:42:58.3842577Z initial apicid : 47 2025-05-07T19:42:58.3842663Z fpu : yes 2025-05-07T19:42:58.3842754Z fpu_exception : yes 2025-05-07T19:42:58.3842855Z cpuid level : 13 2025-05-07T19:42:58.3842936Z wp : yes 2025-05-07T19:42:58.3845239Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3845632Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3845888Z bogomips : 5999.98 2025-05-07T19:42:58.3845966Z clflush size : 64 2025-05-07T19:42:58.3846068Z cache_alignment : 64 2025-05-07T19:42:58.3846191Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3846277Z power management: 2025-05-07T19:42:58.3846283Z 2025-05-07T19:42:58.3846363Z processor : 72 2025-05-07T19:42:58.3846469Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3846551Z cpu family : 6 2025-05-07T19:42:58.3846632Z model : 85 2025-05-07T19:42:58.3846816Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3846893Z stepping : 7 2025-05-07T19:42:58.3846977Z microcode : 0x5003901 2025-05-07T19:42:58.3847060Z cpu MHz : 1523.831 2025-05-07T19:42:58.3847152Z cache size : 36608 KB 2025-05-07T19:42:58.3847237Z physical id : 1 2025-05-07T19:42:58.3847325Z siblings : 48 2025-05-07T19:42:58.3847423Z core id : 0 2025-05-07T19:42:58.3847503Z cpu cores : 24 2025-05-07T19:42:58.3847580Z apicid : 65 2025-05-07T19:42:58.3847661Z initial apicid : 65 2025-05-07T19:42:58.3847746Z fpu : yes 2025-05-07T19:42:58.3847830Z fpu_exception : yes 2025-05-07T19:42:58.3847906Z cpuid level : 13 2025-05-07T19:42:58.3847996Z wp : yes 2025-05-07T19:42:58.3850050Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3850488Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3850590Z bogomips : 5999.98 2025-05-07T19:42:58.3850670Z clflush size : 64 2025-05-07T19:42:58.3850751Z cache_alignment : 64 2025-05-07T19:42:58.3850895Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3850983Z power management: 2025-05-07T19:42:58.3850987Z 2025-05-07T19:42:58.3851065Z processor : 73 2025-05-07T19:42:58.3851157Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3851256Z cpu family : 6 2025-05-07T19:42:58.3851330Z model : 85 2025-05-07T19:42:58.3851490Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3851602Z stepping : 7 2025-05-07T19:42:58.3851684Z microcode : 0x5003901 2025-05-07T19:42:58.3851766Z cpu MHz : 2999.994 2025-05-07T19:42:58.3851846Z cache size : 36608 KB 2025-05-07T19:42:58.3851958Z physical id : 1 2025-05-07T19:42:58.3852037Z siblings : 48 2025-05-07T19:42:58.3852121Z core id : 1 2025-05-07T19:42:58.3852216Z cpu cores : 24 2025-05-07T19:42:58.3852301Z apicid : 67 2025-05-07T19:42:58.3852387Z initial apicid : 67 2025-05-07T19:42:58.3852465Z fpu : yes 2025-05-07T19:42:58.3852572Z fpu_exception : yes 2025-05-07T19:42:58.3852653Z cpuid level : 13 2025-05-07T19:42:58.3852727Z wp : yes 2025-05-07T19:42:58.3854777Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3855155Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3855286Z bogomips : 5999.98 2025-05-07T19:42:58.3855380Z clflush size : 64 2025-05-07T19:42:58.3855467Z cache_alignment : 64 2025-05-07T19:42:58.3855590Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3855687Z power management: 2025-05-07T19:42:58.3855691Z 2025-05-07T19:42:58.3855774Z processor : 74 2025-05-07T19:42:58.3855859Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3855939Z cpu family : 6 2025-05-07T19:42:58.3856033Z model : 85 2025-05-07T19:42:58.3856190Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3856270Z stepping : 7 2025-05-07T19:42:58.3856363Z microcode : 0x5003901 2025-05-07T19:42:58.3856445Z cpu MHz : 2999.994 2025-05-07T19:42:58.3856521Z cache size : 36608 KB 2025-05-07T19:42:58.3856605Z physical id : 1 2025-05-07T19:42:58.3856706Z siblings : 48 2025-05-07T19:42:58.3856784Z core id : 2 2025-05-07T19:42:58.3856869Z cpu cores : 24 2025-05-07T19:42:58.3856964Z apicid : 69 2025-05-07T19:42:58.3857043Z initial apicid : 69 2025-05-07T19:42:58.3857123Z fpu : yes 2025-05-07T19:42:58.3857208Z fpu_exception : yes 2025-05-07T19:42:58.3857293Z cpuid level : 13 2025-05-07T19:42:58.3857369Z wp : yes 2025-05-07T19:42:58.3859414Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3859837Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3859914Z bogomips : 5999.98 2025-05-07T19:42:58.3859999Z clflush size : 64 2025-05-07T19:42:58.3860086Z cache_alignment : 64 2025-05-07T19:42:58.3860210Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3860294Z power management: 2025-05-07T19:42:58.3860298Z 2025-05-07T19:42:58.3860391Z processor : 75 2025-05-07T19:42:58.3860475Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3860551Z cpu family : 6 2025-05-07T19:42:58.3860631Z model : 85 2025-05-07T19:42:58.3860800Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3860882Z stepping : 7 2025-05-07T19:42:58.3860968Z microcode : 0x5003901 2025-05-07T19:42:58.3861070Z cpu MHz : 1201.056 2025-05-07T19:42:58.3861153Z cache size : 36608 KB 2025-05-07T19:42:58.3861237Z physical id : 1 2025-05-07T19:42:58.3861317Z siblings : 48 2025-05-07T19:42:58.3861413Z core id : 3 2025-05-07T19:42:58.3861495Z cpu cores : 24 2025-05-07T19:42:58.3861575Z apicid : 71 2025-05-07T19:42:58.3861660Z initial apicid : 71 2025-05-07T19:42:58.3861759Z fpu : yes 2025-05-07T19:42:58.3861850Z fpu_exception : yes 2025-05-07T19:42:58.3861932Z cpuid level : 13 2025-05-07T19:42:58.3862037Z wp : yes 2025-05-07T19:42:58.3864069Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3864437Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3864542Z bogomips : 5999.98 2025-05-07T19:42:58.3864621Z clflush size : 64 2025-05-07T19:42:58.3864753Z cache_alignment : 64 2025-05-07T19:42:58.3864897Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3864977Z power management: 2025-05-07T19:42:58.3864981Z 2025-05-07T19:42:58.3865059Z processor : 76 2025-05-07T19:42:58.3865161Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3865242Z cpu family : 6 2025-05-07T19:42:58.3865319Z model : 85 2025-05-07T19:42:58.3865467Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3865567Z stepping : 7 2025-05-07T19:42:58.3865656Z microcode : 0x5003901 2025-05-07T19:42:58.3865736Z cpu MHz : 2999.994 2025-05-07T19:42:58.3865813Z cache size : 36608 KB 2025-05-07T19:42:58.3865913Z physical id : 1 2025-05-07T19:42:58.3865990Z siblings : 48 2025-05-07T19:42:58.3866064Z core id : 4 2025-05-07T19:42:58.3866157Z cpu cores : 24 2025-05-07T19:42:58.3866231Z apicid : 73 2025-05-07T19:42:58.3866317Z initial apicid : 73 2025-05-07T19:42:58.3866395Z fpu : yes 2025-05-07T19:42:58.3866498Z fpu_exception : yes 2025-05-07T19:42:58.3866576Z cpuid level : 13 2025-05-07T19:42:58.3866652Z wp : yes 2025-05-07T19:42:58.3868703Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3869071Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3869154Z bogomips : 5999.98 2025-05-07T19:42:58.3869293Z clflush size : 64 2025-05-07T19:42:58.3869381Z cache_alignment : 64 2025-05-07T19:42:58.3869510Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3869608Z power management: 2025-05-07T19:42:58.3869613Z 2025-05-07T19:42:58.3869688Z processor : 77 2025-05-07T19:42:58.3869771Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3869850Z cpu family : 6 2025-05-07T19:42:58.3869946Z model : 85 2025-05-07T19:42:58.3870112Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3870194Z stepping : 7 2025-05-07T19:42:58.3870291Z microcode : 0x5003901 2025-05-07T19:42:58.3870363Z cpu MHz : 2999.994 2025-05-07T19:42:58.3870451Z cache size : 36608 KB 2025-05-07T19:42:58.3870532Z physical id : 1 2025-05-07T19:42:58.3870630Z siblings : 48 2025-05-07T19:42:58.3870713Z core id : 5 2025-05-07T19:42:58.3870800Z cpu cores : 24 2025-05-07T19:42:58.3870888Z apicid : 75 2025-05-07T19:42:58.3870970Z initial apicid : 75 2025-05-07T19:42:58.3871053Z fpu : yes 2025-05-07T19:42:58.3871143Z fpu_exception : yes 2025-05-07T19:42:58.3871237Z cpuid level : 13 2025-05-07T19:42:58.3871323Z wp : yes 2025-05-07T19:42:58.3873664Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3874082Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3874171Z bogomips : 5999.98 2025-05-07T19:42:58.3874258Z clflush size : 64 2025-05-07T19:42:58.3874377Z cache_alignment : 64 2025-05-07T19:42:58.3874516Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3874660Z power management: 2025-05-07T19:42:58.3874664Z 2025-05-07T19:42:58.3874775Z processor : 78 2025-05-07T19:42:58.3874868Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3874951Z cpu family : 6 2025-05-07T19:42:58.3875035Z model : 85 2025-05-07T19:42:58.3875220Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3875307Z stepping : 7 2025-05-07T19:42:58.3875399Z microcode : 0x5003901 2025-05-07T19:42:58.3875508Z cpu MHz : 2999.994 2025-05-07T19:42:58.3875593Z cache size : 36608 KB 2025-05-07T19:42:58.3875681Z physical id : 1 2025-05-07T19:42:58.3875765Z siblings : 48 2025-05-07T19:42:58.3875863Z core id : 6 2025-05-07T19:42:58.3875945Z cpu cores : 24 2025-05-07T19:42:58.3876024Z apicid : 77 2025-05-07T19:42:58.3876131Z initial apicid : 77 2025-05-07T19:42:58.3876208Z fpu : yes 2025-05-07T19:42:58.3876297Z fpu_exception : yes 2025-05-07T19:42:58.3876383Z cpuid level : 13 2025-05-07T19:42:58.3876471Z wp : yes 2025-05-07T19:42:58.3878662Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3879070Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3879153Z bogomips : 5999.98 2025-05-07T19:42:58.3879233Z clflush size : 64 2025-05-07T19:42:58.3879321Z cache_alignment : 64 2025-05-07T19:42:58.3879512Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3879597Z power management: 2025-05-07T19:42:58.3879606Z 2025-05-07T19:42:58.3879685Z processor : 79 2025-05-07T19:42:58.3879813Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3879895Z cpu family : 6 2025-05-07T19:42:58.3879976Z model : 85 2025-05-07T19:42:58.3880141Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3880240Z stepping : 7 2025-05-07T19:42:58.3880326Z microcode : 0x5003901 2025-05-07T19:42:58.3880410Z cpu MHz : 2999.994 2025-05-07T19:42:58.3880509Z cache size : 36608 KB 2025-05-07T19:42:58.3880592Z physical id : 1 2025-05-07T19:42:58.3880673Z siblings : 48 2025-05-07T19:42:58.3880748Z core id : 7 2025-05-07T19:42:58.3880846Z cpu cores : 24 2025-05-07T19:42:58.3880923Z apicid : 79 2025-05-07T19:42:58.3881009Z initial apicid : 79 2025-05-07T19:42:58.3881099Z fpu : yes 2025-05-07T19:42:58.3881188Z fpu_exception : yes 2025-05-07T19:42:58.3881268Z cpuid level : 13 2025-05-07T19:42:58.3881349Z wp : yes 2025-05-07T19:42:58.3883579Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3883979Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3884078Z bogomips : 5999.98 2025-05-07T19:42:58.3884167Z clflush size : 64 2025-05-07T19:42:58.3884254Z cache_alignment : 64 2025-05-07T19:42:58.3884393Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3884498Z power management: 2025-05-07T19:42:58.3884503Z 2025-05-07T19:42:58.3884656Z processor : 80 2025-05-07T19:42:58.3884755Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3884850Z cpu family : 6 2025-05-07T19:42:58.3884932Z model : 85 2025-05-07T19:42:58.3885202Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3885279Z stepping : 7 2025-05-07T19:42:58.3885379Z microcode : 0x5003901 2025-05-07T19:42:58.3885460Z cpu MHz : 2999.994 2025-05-07T19:42:58.3885545Z cache size : 36608 KB 2025-05-07T19:42:58.3885643Z physical id : 1 2025-05-07T19:42:58.3885721Z siblings : 48 2025-05-07T19:42:58.3885800Z core id : 8 2025-05-07T19:42:58.3885879Z cpu cores : 24 2025-05-07T19:42:58.3885966Z apicid : 81 2025-05-07T19:42:58.3886050Z initial apicid : 81 2025-05-07T19:42:58.3886124Z fpu : yes 2025-05-07T19:42:58.3886206Z fpu_exception : yes 2025-05-07T19:42:58.3886300Z cpuid level : 13 2025-05-07T19:42:58.3886377Z wp : yes 2025-05-07T19:42:58.3888407Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3888786Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3888869Z bogomips : 5999.98 2025-05-07T19:42:58.3888950Z clflush size : 64 2025-05-07T19:42:58.3889042Z cache_alignment : 64 2025-05-07T19:42:58.3889164Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3889246Z power management: 2025-05-07T19:42:58.3889296Z 2025-05-07T19:42:58.3889386Z processor : 81 2025-05-07T19:42:58.3889472Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3889549Z cpu family : 6 2025-05-07T19:42:58.3889637Z model : 85 2025-05-07T19:42:58.3889786Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3889861Z stepping : 7 2025-05-07T19:42:58.3889937Z microcode : 0x5003901 2025-05-07T19:42:58.3890026Z cpu MHz : 2999.994 2025-05-07T19:42:58.3890105Z cache size : 36608 KB 2025-05-07T19:42:58.3890187Z physical id : 1 2025-05-07T19:42:58.3890278Z siblings : 48 2025-05-07T19:42:58.3890354Z core id : 9 2025-05-07T19:42:58.3890431Z cpu cores : 24 2025-05-07T19:42:58.3890509Z apicid : 83 2025-05-07T19:42:58.3890607Z initial apicid : 83 2025-05-07T19:42:58.3890679Z fpu : yes 2025-05-07T19:42:58.3890765Z fpu_exception : yes 2025-05-07T19:42:58.3890841Z cpuid level : 13 2025-05-07T19:42:58.3890933Z wp : yes 2025-05-07T19:42:58.3892974Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3893359Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3893439Z bogomips : 5999.98 2025-05-07T19:42:58.3893516Z clflush size : 64 2025-05-07T19:42:58.3893601Z cache_alignment : 64 2025-05-07T19:42:58.3893731Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3893811Z power management: 2025-05-07T19:42:58.3893815Z 2025-05-07T19:42:58.3893893Z processor : 82 2025-05-07T19:42:58.3893992Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3894122Z cpu family : 6 2025-05-07T19:42:58.3894198Z model : 85 2025-05-07T19:42:58.3894344Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3894435Z stepping : 7 2025-05-07T19:42:58.3894511Z microcode : 0x5003901 2025-05-07T19:42:58.3894586Z cpu MHz : 1245.833 2025-05-07T19:42:58.3894679Z cache size : 36608 KB 2025-05-07T19:42:58.3894759Z physical id : 1 2025-05-07T19:42:58.3894835Z siblings : 48 2025-05-07T19:42:58.3894910Z core id : 10 2025-05-07T19:42:58.3894992Z cpu cores : 24 2025-05-07T19:42:58.3895066Z apicid : 85 2025-05-07T19:42:58.3895143Z initial apicid : 85 2025-05-07T19:42:58.3895235Z fpu : yes 2025-05-07T19:42:58.3895309Z fpu_exception : yes 2025-05-07T19:42:58.3895380Z cpuid level : 13 2025-05-07T19:42:58.3895449Z wp : yes 2025-05-07T19:42:58.3897510Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3897880Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3897983Z bogomips : 5999.98 2025-05-07T19:42:58.3898058Z clflush size : 64 2025-05-07T19:42:58.3898140Z cache_alignment : 64 2025-05-07T19:42:58.3898268Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3898360Z power management: 2025-05-07T19:42:58.3898364Z 2025-05-07T19:42:58.3898437Z processor : 83 2025-05-07T19:42:58.3898570Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3898659Z cpu family : 6 2025-05-07T19:42:58.3898748Z model : 85 2025-05-07T19:42:58.3898911Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3898998Z stepping : 7 2025-05-07T19:42:58.3899082Z microcode : 0x5003901 2025-05-07T19:42:58.3899159Z cpu MHz : 2999.994 2025-05-07T19:42:58.3899258Z cache size : 36608 KB 2025-05-07T19:42:58.3899338Z physical id : 1 2025-05-07T19:42:58.3899417Z siblings : 48 2025-05-07T19:42:58.3899501Z core id : 11 2025-05-07T19:42:58.3899591Z cpu cores : 24 2025-05-07T19:42:58.3899666Z apicid : 87 2025-05-07T19:42:58.3899754Z initial apicid : 87 2025-05-07T19:42:58.3899846Z fpu : yes 2025-05-07T19:42:58.3899928Z fpu_exception : yes 2025-05-07T19:42:58.3900011Z cpuid level : 13 2025-05-07T19:42:58.3900088Z wp : yes 2025-05-07T19:42:58.3902135Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3902504Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3902611Z bogomips : 5999.98 2025-05-07T19:42:58.3902692Z clflush size : 64 2025-05-07T19:42:58.3902778Z cache_alignment : 64 2025-05-07T19:42:58.3902905Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3902999Z power management: 2025-05-07T19:42:58.3903003Z 2025-05-07T19:42:58.3903076Z processor : 84 2025-05-07T19:42:58.3903165Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3903261Z cpu family : 6 2025-05-07T19:42:58.3903341Z model : 85 2025-05-07T19:42:58.3903496Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3903624Z stepping : 7 2025-05-07T19:42:58.3903711Z microcode : 0x5003901 2025-05-07T19:42:58.3903793Z cpu MHz : 2999.994 2025-05-07T19:42:58.3903877Z cache size : 36608 KB 2025-05-07T19:42:58.3903977Z physical id : 1 2025-05-07T19:42:58.3904052Z siblings : 48 2025-05-07T19:42:58.3904129Z core id : 12 2025-05-07T19:42:58.3904209Z cpu cores : 24 2025-05-07T19:42:58.3904295Z apicid : 89 2025-05-07T19:42:58.3904379Z initial apicid : 89 2025-05-07T19:42:58.3904458Z fpu : yes 2025-05-07T19:42:58.3904544Z fpu_exception : yes 2025-05-07T19:42:58.3904630Z cpuid level : 13 2025-05-07T19:42:58.3904705Z wp : yes 2025-05-07T19:42:58.3906756Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3907136Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3907212Z bogomips : 5999.98 2025-05-07T19:42:58.3907312Z clflush size : 64 2025-05-07T19:42:58.3907388Z cache_alignment : 64 2025-05-07T19:42:58.3907510Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3907593Z power management: 2025-05-07T19:42:58.3907597Z 2025-05-07T19:42:58.3907688Z processor : 85 2025-05-07T19:42:58.3907779Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3907860Z cpu family : 6 2025-05-07T19:42:58.3907948Z model : 85 2025-05-07T19:42:58.3908146Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3908233Z stepping : 7 2025-05-07T19:42:58.3908308Z microcode : 0x5003901 2025-05-07T19:42:58.3908393Z cpu MHz : 1199.590 2025-05-07T19:42:58.3908466Z cache size : 36608 KB 2025-05-07T19:42:58.3908538Z physical id : 1 2025-05-07T19:42:58.3908613Z siblings : 48 2025-05-07T19:42:58.3908686Z core id : 13 2025-05-07T19:42:58.3908761Z cpu cores : 24 2025-05-07T19:42:58.3908838Z apicid : 91 2025-05-07T19:42:58.3908934Z initial apicid : 91 2025-05-07T19:42:58.3909010Z fpu : yes 2025-05-07T19:42:58.3909097Z fpu_exception : yes 2025-05-07T19:42:58.3909170Z cpuid level : 13 2025-05-07T19:42:58.3909254Z wp : yes 2025-05-07T19:42:58.3911292Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3911673Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3911749Z bogomips : 5999.98 2025-05-07T19:42:58.3911835Z clflush size : 64 2025-05-07T19:42:58.3911922Z cache_alignment : 64 2025-05-07T19:42:58.3912058Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3912143Z power management: 2025-05-07T19:42:58.3912147Z 2025-05-07T19:42:58.3912224Z processor : 86 2025-05-07T19:42:58.3912325Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3912400Z cpu family : 6 2025-05-07T19:42:58.3912562Z model : 85 2025-05-07T19:42:58.3912743Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3912816Z stepping : 7 2025-05-07T19:42:58.3913463Z microcode : 0x5003901 2025-05-07T19:42:58.3913546Z cpu MHz : 2999.994 2025-05-07T19:42:58.3913643Z cache size : 36608 KB 2025-05-07T19:42:58.3913725Z physical id : 1 2025-05-07T19:42:58.3913855Z siblings : 48 2025-05-07T19:42:58.3913944Z core id : 14 2025-05-07T19:42:58.3914035Z cpu cores : 24 2025-05-07T19:42:58.3914113Z apicid : 93 2025-05-07T19:42:58.3914200Z initial apicid : 93 2025-05-07T19:42:58.3914296Z fpu : yes 2025-05-07T19:42:58.3914381Z fpu_exception : yes 2025-05-07T19:42:58.3914465Z cpuid level : 13 2025-05-07T19:42:58.3914547Z wp : yes 2025-05-07T19:42:58.3916756Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3917156Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3917262Z bogomips : 5999.98 2025-05-07T19:42:58.3917339Z clflush size : 64 2025-05-07T19:42:58.3917419Z cache_alignment : 64 2025-05-07T19:42:58.3917548Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3917647Z power management: 2025-05-07T19:42:58.3917652Z 2025-05-07T19:42:58.3917730Z processor : 87 2025-05-07T19:42:58.3917815Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3917908Z cpu family : 6 2025-05-07T19:42:58.3917980Z model : 85 2025-05-07T19:42:58.3918139Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3918283Z stepping : 7 2025-05-07T19:42:58.3918377Z microcode : 0x5003901 2025-05-07T19:42:58.3918460Z cpu MHz : 1254.462 2025-05-07T19:42:58.3918541Z cache size : 36608 KB 2025-05-07T19:42:58.3918638Z physical id : 1 2025-05-07T19:42:58.3918719Z siblings : 48 2025-05-07T19:42:58.3918803Z core id : 15 2025-05-07T19:42:58.3918880Z cpu cores : 24 2025-05-07T19:42:58.3918983Z apicid : 95 2025-05-07T19:42:58.3919075Z initial apicid : 95 2025-05-07T19:42:58.3919163Z fpu : yes 2025-05-07T19:42:58.3919275Z fpu_exception : yes 2025-05-07T19:42:58.3919361Z cpuid level : 13 2025-05-07T19:42:58.3919443Z wp : yes 2025-05-07T19:42:58.3921685Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3922084Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3922173Z bogomips : 5999.98 2025-05-07T19:42:58.3922282Z clflush size : 64 2025-05-07T19:42:58.3922365Z cache_alignment : 64 2025-05-07T19:42:58.3922500Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3922588Z power management: 2025-05-07T19:42:58.3922592Z 2025-05-07T19:42:58.3922685Z processor : 88 2025-05-07T19:42:58.3922778Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3922862Z cpu family : 6 2025-05-07T19:42:58.3922959Z model : 85 2025-05-07T19:42:58.3923124Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3923210Z stepping : 7 2025-05-07T19:42:58.3923304Z microcode : 0x5003901 2025-05-07T19:42:58.3923396Z cpu MHz : 1277.194 2025-05-07T19:42:58.3923535Z cache size : 36608 KB 2025-05-07T19:42:58.3923624Z physical id : 1 2025-05-07T19:42:58.3923727Z siblings : 48 2025-05-07T19:42:58.3923808Z core id : 16 2025-05-07T19:42:58.3923887Z cpu cores : 24 2025-05-07T19:42:58.3923959Z apicid : 97 2025-05-07T19:42:58.3924054Z initial apicid : 97 2025-05-07T19:42:58.3924141Z fpu : yes 2025-05-07T19:42:58.3924230Z fpu_exception : yes 2025-05-07T19:42:58.3924337Z cpuid level : 13 2025-05-07T19:42:58.3924420Z wp : yes 2025-05-07T19:42:58.3926637Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3927018Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3927099Z bogomips : 5999.98 2025-05-07T19:42:58.3927173Z clflush size : 64 2025-05-07T19:42:58.3927269Z cache_alignment : 64 2025-05-07T19:42:58.3927393Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3927474Z power management: 2025-05-07T19:42:58.3927478Z 2025-05-07T19:42:58.3927561Z processor : 89 2025-05-07T19:42:58.3927663Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3927742Z cpu family : 6 2025-05-07T19:42:58.3927811Z model : 85 2025-05-07T19:42:58.3927987Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3928067Z stepping : 7 2025-05-07T19:42:58.3928152Z microcode : 0x5003901 2025-05-07T19:42:58.3928271Z cpu MHz : 2999.994 2025-05-07T19:42:58.3928359Z cache size : 36608 KB 2025-05-07T19:42:58.3928444Z physical id : 1 2025-05-07T19:42:58.3928520Z siblings : 48 2025-05-07T19:42:58.3928617Z core id : 17 2025-05-07T19:42:58.3928690Z cpu cores : 24 2025-05-07T19:42:58.3928767Z apicid : 99 2025-05-07T19:42:58.3928857Z initial apicid : 99 2025-05-07T19:42:58.3928944Z fpu : yes 2025-05-07T19:42:58.3929025Z fpu_exception : yes 2025-05-07T19:42:58.3929100Z cpuid level : 13 2025-05-07T19:42:58.3929193Z wp : yes 2025-05-07T19:42:58.3931371Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3931918Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3932018Z bogomips : 5999.98 2025-05-07T19:42:58.3932105Z clflush size : 64 2025-05-07T19:42:58.3932194Z cache_alignment : 64 2025-05-07T19:42:58.3932459Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3932549Z power management: 2025-05-07T19:42:58.3932553Z 2025-05-07T19:42:58.3932632Z processor : 90 2025-05-07T19:42:58.3932723Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3932888Z cpu family : 6 2025-05-07T19:42:58.3932967Z model : 85 2025-05-07T19:42:58.3933129Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3933225Z stepping : 7 2025-05-07T19:42:58.3933309Z microcode : 0x5003901 2025-05-07T19:42:58.3933387Z cpu MHz : 2999.994 2025-05-07T19:42:58.3933477Z cache size : 36608 KB 2025-05-07T19:42:58.3933574Z physical id : 1 2025-05-07T19:42:58.3933740Z siblings : 48 2025-05-07T19:42:58.3933816Z core id : 18 2025-05-07T19:42:58.3933918Z cpu cores : 24 2025-05-07T19:42:58.3933999Z apicid : 101 2025-05-07T19:42:58.3934086Z initial apicid : 101 2025-05-07T19:42:58.3934168Z fpu : yes 2025-05-07T19:42:58.3934273Z fpu_exception : yes 2025-05-07T19:42:58.3934353Z cpuid level : 13 2025-05-07T19:42:58.3934431Z wp : yes 2025-05-07T19:42:58.3936661Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3937061Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3937143Z bogomips : 5999.98 2025-05-07T19:42:58.3937249Z clflush size : 64 2025-05-07T19:42:58.3937331Z cache_alignment : 64 2025-05-07T19:42:58.3937473Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3937575Z power management: 2025-05-07T19:42:58.3937580Z 2025-05-07T19:42:58.3937659Z processor : 91 2025-05-07T19:42:58.3937746Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3937837Z cpu family : 6 2025-05-07T19:42:58.3937930Z model : 85 2025-05-07T19:42:58.3938090Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3938170Z stepping : 7 2025-05-07T19:42:58.3938270Z microcode : 0x5003901 2025-05-07T19:42:58.3938351Z cpu MHz : 2999.994 2025-05-07T19:42:58.3938439Z cache size : 36608 KB 2025-05-07T19:42:58.3938597Z physical id : 1 2025-05-07T19:42:58.3938690Z siblings : 48 2025-05-07T19:42:58.3938775Z core id : 19 2025-05-07T19:42:58.3938859Z cpu cores : 24 2025-05-07T19:42:58.3938954Z apicid : 103 2025-05-07T19:42:58.3939042Z initial apicid : 103 2025-05-07T19:42:58.3939121Z fpu : yes 2025-05-07T19:42:58.3939206Z fpu_exception : yes 2025-05-07T19:42:58.3939296Z cpuid level : 13 2025-05-07T19:42:58.3939369Z wp : yes 2025-05-07T19:42:58.3941566Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3941972Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3942057Z bogomips : 5999.98 2025-05-07T19:42:58.3942143Z clflush size : 64 2025-05-07T19:42:58.3942239Z cache_alignment : 64 2025-05-07T19:42:58.3942369Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3942460Z power management: 2025-05-07T19:42:58.3942464Z 2025-05-07T19:42:58.3942553Z processor : 92 2025-05-07T19:42:58.3942644Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3942730Z cpu family : 6 2025-05-07T19:42:58.3942813Z model : 85 2025-05-07T19:42:58.3942985Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3943066Z stepping : 7 2025-05-07T19:42:58.3943151Z microcode : 0x5003901 2025-05-07T19:42:58.3943244Z cpu MHz : 2999.994 2025-05-07T19:42:58.3943326Z cache size : 36608 KB 2025-05-07T19:42:58.3943410Z physical id : 1 2025-05-07T19:42:58.3943502Z siblings : 48 2025-05-07T19:42:58.3943694Z core id : 20 2025-05-07T19:42:58.3943770Z cpu cores : 24 2025-05-07T19:42:58.3943894Z apicid : 105 2025-05-07T19:42:58.3943975Z initial apicid : 105 2025-05-07T19:42:58.3944056Z fpu : yes 2025-05-07T19:42:58.3944132Z fpu_exception : yes 2025-05-07T19:42:58.3944206Z cpuid level : 13 2025-05-07T19:42:58.3944286Z wp : yes 2025-05-07T19:42:58.3946309Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3946675Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3946766Z bogomips : 5999.98 2025-05-07T19:42:58.3946837Z clflush size : 64 2025-05-07T19:42:58.3946921Z cache_alignment : 64 2025-05-07T19:42:58.3947055Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3947130Z power management: 2025-05-07T19:42:58.3947134Z 2025-05-07T19:42:58.3947210Z processor : 93 2025-05-07T19:42:58.3947306Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3947380Z cpu family : 6 2025-05-07T19:42:58.3947453Z model : 85 2025-05-07T19:42:58.3947603Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3947683Z stepping : 7 2025-05-07T19:42:58.3947759Z microcode : 0x5003901 2025-05-07T19:42:58.3947830Z cpu MHz : 2999.994 2025-05-07T19:42:58.3947914Z cache size : 36608 KB 2025-05-07T19:42:58.3947984Z physical id : 1 2025-05-07T19:42:58.3948061Z siblings : 48 2025-05-07T19:42:58.3948130Z core id : 21 2025-05-07T19:42:58.3948283Z cpu cores : 24 2025-05-07T19:42:58.3948371Z apicid : 107 2025-05-07T19:42:58.3948467Z initial apicid : 107 2025-05-07T19:42:58.3948551Z fpu : yes 2025-05-07T19:42:58.3948668Z fpu_exception : yes 2025-05-07T19:42:58.3948758Z cpuid level : 13 2025-05-07T19:42:58.3948839Z wp : yes 2025-05-07T19:42:58.3950912Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3951287Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3951385Z bogomips : 5999.98 2025-05-07T19:42:58.3951511Z clflush size : 64 2025-05-07T19:42:58.3951616Z cache_alignment : 64 2025-05-07T19:42:58.3951756Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3951882Z power management: 2025-05-07T19:42:58.3951886Z 2025-05-07T19:42:58.3951960Z processor : 94 2025-05-07T19:42:58.3952054Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3952138Z cpu family : 6 2025-05-07T19:42:58.3952243Z model : 85 2025-05-07T19:42:58.3952395Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3952533Z stepping : 7 2025-05-07T19:42:58.3952654Z microcode : 0x5003901 2025-05-07T19:42:58.3952740Z cpu MHz : 2999.994 2025-05-07T19:42:58.3952824Z cache size : 36608 KB 2025-05-07T19:42:58.3953073Z physical id : 1 2025-05-07T19:42:58.3953197Z siblings : 48 2025-05-07T19:42:58.3953291Z core id : 22 2025-05-07T19:42:58.3953382Z cpu cores : 24 2025-05-07T19:42:58.3953509Z apicid : 109 2025-05-07T19:42:58.3953612Z initial apicid : 109 2025-05-07T19:42:58.3953838Z fpu : yes 2025-05-07T19:42:58.3953935Z fpu_exception : yes 2025-05-07T19:42:58.3954051Z cpuid level : 13 2025-05-07T19:42:58.3954137Z wp : yes 2025-05-07T19:42:58.3956366Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3956798Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3956897Z bogomips : 5999.98 2025-05-07T19:42:58.3956986Z clflush size : 64 2025-05-07T19:42:58.3957105Z cache_alignment : 64 2025-05-07T19:42:58.3957247Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3957344Z power management: 2025-05-07T19:42:58.3957348Z 2025-05-07T19:42:58.3957464Z processor : 95 2025-05-07T19:42:58.3957566Z vendor_id : GenuineIntel 2025-05-07T19:42:58.3957656Z cpu family : 6 2025-05-07T19:42:58.3957740Z model : 85 2025-05-07T19:42:58.3957918Z model name : Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:42:58.3957998Z stepping : 7 2025-05-07T19:42:58.3958087Z microcode : 0x5003901 2025-05-07T19:42:58.3958187Z cpu MHz : 1200.562 2025-05-07T19:42:58.3958276Z cache size : 36608 KB 2025-05-07T19:42:58.3958370Z physical id : 1 2025-05-07T19:42:58.3958458Z siblings : 48 2025-05-07T19:42:58.3958552Z core id : 23 2025-05-07T19:42:58.3958643Z cpu cores : 24 2025-05-07T19:42:58.3958734Z apicid : 111 2025-05-07T19:42:58.3958908Z initial apicid : 111 2025-05-07T19:42:58.3958998Z fpu : yes 2025-05-07T19:42:58.3959099Z fpu_exception : yes 2025-05-07T19:42:58.3959195Z cpuid level : 13 2025-05-07T19:42:58.3959302Z wp : yes 2025-05-07T19:42:58.3961507Z flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:42:58.3961927Z bugs : cpu_meltdown spectre_v1 spectre_v2 spec_store_bypass l1tf mds swapgs itlb_multihit mmio_stale_data retbleed gds bhi 2025-05-07T19:42:58.3962023Z bogomips : 5999.98 2025-05-07T19:42:58.3962113Z clflush size : 64 2025-05-07T19:42:58.3962212Z cache_alignment : 64 2025-05-07T19:42:58.3962382Z address sizes : 46 bits physical, 48 bits virtual 2025-05-07T19:42:58.3962473Z power management: 2025-05-07T19:42:58.3962477Z 2025-05-07T19:42:58.3962481Z 2025-05-07T19:42:58.3962609Z ################################################################################ 2025-05-07T19:42:58.3962739Z [INFO] Print PCI info ... 2025-05-07T19:42:58.3962830Z + lspci -v 2025-05-07T19:42:58.3962834Z 2025-05-07T19:42:58.3963021Z 00:00.0 Host bridge: Intel Corporation 440FX - 82441FX PMC [Natoma] 2025-05-07T19:42:58.3963144Z Subsystem: Amazon.com, Inc. Device 1237 2025-05-07T19:42:58.3963264Z Flags: bus master, medium devsel, latency 0 2025-05-07T19:42:58.3963269Z 2025-05-07T19:42:58.3963486Z 00:01.0 ISA bridge: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] 2025-05-07T19:42:58.3963585Z Physical Slot: 1 2025-05-07T19:42:58.3963712Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:42:58.3963717Z 2025-05-07T19:42:58.3963980Z 00:01.3 Non-VGA unclassified device: Intel Corporation 82371AB/EB/MB PIIX4 ACPI (rev 08) 2025-05-07T19:42:58.3964131Z Physical Slot: 1 2025-05-07T19:42:58.3964266Z Flags: bus master, fast devsel, latency 0, IRQ 9 2025-05-07T19:42:58.3964271Z 2025-05-07T19:42:58.3964543Z 00:03.0 VGA compatible controller: Amazon.com, Inc. Device 1111 (prog-if 00 [VGA controller]) 2025-05-07T19:42:58.3964622Z Physical Slot: 3 2025-05-07T19:42:58.3964747Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:42:58.3964879Z Memory at c0000000 (32-bit, prefetchable) [size=4M] 2025-05-07T19:42:58.3965007Z Expansion ROM at 000c0000 [disabled] [size=128K] 2025-05-07T19:42:58.3965011Z 2025-05-07T19:42:58.3965468Z 00:04.0 Non-Volatile memory controller: Amazon.com, Inc. NVMe EBS Controller (prog-if 02 [NVM Express]) 2025-05-07T19:42:58.3965570Z Subsystem: Amazon.com, Inc. Device 0000 2025-05-07T19:42:58.3965644Z Physical Slot: 4 2025-05-07T19:42:58.3965767Z Flags: bus master, fast devsel, latency 0, IRQ 11 2025-05-07T19:42:58.3965935Z Memory at c0514000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:42:58.3966045Z Capabilities: 2025-05-07T19:42:58.3966129Z Kernel driver in use: nvme 2025-05-07T19:42:58.3966133Z 2025-05-07T19:42:58.3966366Z 00:05.0 Ethernet controller: Amazon.com, Inc. Elastic Network Adapter (ENA) 2025-05-07T19:42:58.3966454Z Physical Slot: 5 2025-05-07T19:42:58.3966570Z Flags: bus master, fast devsel, latency 0 2025-05-07T19:42:58.3966743Z Memory at c0510000 (32-bit, non-prefetchable) [size=16K] 2025-05-07T19:42:58.3966871Z Memory at c0400000 (32-bit, prefetchable) [size=1M] 2025-05-07T19:42:58.3967021Z Memory at c0500000 (32-bit, non-prefetchable) [size=64K] 2025-05-07T19:42:58.3967119Z Capabilities: 2025-05-07T19:42:58.3967239Z Kernel driver in use: ena 2025-05-07T19:42:58.3967244Z 2025-05-07T19:42:58.3967247Z 2025-05-07T19:42:58.3967396Z ################################################################################ 2025-05-07T19:42:58.3967504Z [INFO] Print Linux distribution info ... 2025-05-07T19:42:58.3967588Z + uname -a 2025-05-07T19:42:58.3967593Z 2025-05-07T19:42:58.3967964Z Linux 9d6bffcb154a 6.1.130-139.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-05-07T19:42:58.3967969Z 2025-05-07T19:42:58.3968040Z + uname -m 2025-05-07T19:42:58.3968044Z 2025-05-07T19:42:58.3968150Z x86_64 2025-05-07T19:42:58.3968155Z 2025-05-07T19:42:58.3968236Z + cat /proc/version 2025-05-07T19:42:58.3968240Z 2025-05-07T19:42:58.3968810Z Linux version 6.1.130-139.222.amzn2023.x86_64 (mockbuild@ip-10-0-55-76) (gcc (GCC) 11.5.0 20240719 (Red Hat 11.5.0-5), GNU ld version 2.39-6.amzn2023.0.11) #1 SMP PREEMPT_DYNAMIC Tue Mar 11 01:10:58 UTC 2025 2025-05-07T19:42:58.3968841Z 2025-05-07T19:42:58.3968931Z + cat /etc/os-release 2025-05-07T19:42:58.3968936Z 2025-05-07T19:42:58.3969024Z NAME="Amazon Linux" 2025-05-07T19:42:58.3969097Z VERSION="2023" 2025-05-07T19:42:58.3969197Z ID="amzn" 2025-05-07T19:42:58.3969276Z ID_LIKE="fedora" 2025-05-07T19:42:58.3969351Z VERSION_ID="2023" 2025-05-07T19:42:58.3969477Z PLATFORM_ID="platform:al2023" 2025-05-07T19:42:58.3969586Z PRETTY_NAME="Amazon Linux 2023.7.20250428" 2025-05-07T19:42:58.3969670Z ANSI_COLOR="0;33" 2025-05-07T19:42:58.3969779Z CPE_NAME="cpe:2.3:o:amazon:amazon_linux:2023" 2025-05-07T19:42:58.3969976Z HOME_URL="https://aws.amazon.com/linux/amazon-linux-2023/" 2025-05-07T19:42:58.3970141Z DOCUMENTATION_URL="https://docs.aws.amazon.com/linux/" 2025-05-07T19:42:58.3970308Z SUPPORT_URL="https://aws.amazon.com/premiumsupport/" 2025-05-07T19:42:58.3970521Z BUG_REPORT_URL="https://github.com/amazonlinux/amazon-linux-2023" 2025-05-07T19:42:58.3970605Z VENDOR_NAME="AWS" 2025-05-07T19:42:58.3970717Z VENDOR_URL="https://aws.amazon.com/" 2025-05-07T19:42:58.3970815Z SUPPORT_END="2029-06-30" 2025-05-07T19:42:58.3970841Z 2025-05-07T19:42:58.4009160Z ##[group]Run . $PRELUDE; print_gpu_info 2025-05-07T19:42:58.4009325Z . $PRELUDE; print_gpu_info 2025-05-07T19:42:58.4009631Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:42:58.4009789Z env: 2025-05-07T19:42:58.4009900Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:42:58.4009993Z BUILD_ENV: build_binary 2025-05-07T19:42:58.4010076Z BUILD_TARGET: default 2025-05-07T19:42:58.4010154Z BUILD_VARIANT: cuda 2025-05-07T19:42:58.4010246Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:42:58.4010321Z ##[endgroup] 2025-05-07T19:42:58.8091152Z ################################################################################ 2025-05-07T19:42:58.8114419Z [INFO] Printing general display info ... 2025-05-07T19:42:58.8114911Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:42:58.9012618Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:42:58.9024553Z /usr/bin/sudo 2025-05-07T19:42:58.9034600Z which: no apt-get in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:42:58.9045126Z /usr/bin/yum 2025-05-07T19:42:58.9045538Z [INSTALL] Updating system repositories ... 2025-05-07T19:42:58.9070142Z [EXEC] [ATTEMPT 0/3] + sudo yum update -y 2025-05-07T19:42:59.1278308Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:59.2251262Z Dependencies resolved. 2025-05-07T19:42:59.2466222Z Nothing to do. 2025-05-07T19:42:59.2466921Z Complete! 2025-05-07T19:42:59.3082241Z [INSTALL] Installing system package(s): hostname lshw ... 2025-05-07T19:42:59.3105665Z [EXEC] [ATTEMPT 0/3] + sudo yum install -y hostname lshw 2025-05-07T19:42:59.5246852Z Last metadata expiration check: 0:00:18 ago on Wed May 7 19:42:41 2025. 2025-05-07T19:42:59.5771522Z Dependencies resolved. 2025-05-07T19:42:59.5935303Z ================================================================================ 2025-05-07T19:42:59.5936051Z Package Arch Version Repository Size 2025-05-07T19:42:59.5936581Z ================================================================================ 2025-05-07T19:42:59.5936918Z Installing: 2025-05-07T19:42:59.5937347Z hostname x86_64 3.23-4.amzn2023.0.3 amazonlinux 28 k 2025-05-07T19:42:59.5937841Z lshw x86_64 B.02.19.2-7.amzn2023.0.3 amazonlinux 319 k 2025-05-07T19:42:59.5938202Z 2025-05-07T19:42:59.5938301Z Transaction Summary 2025-05-07T19:42:59.5938564Z ================================================================================ 2025-05-07T19:42:59.5938916Z Install 2 Packages 2025-05-07T19:42:59.5939060Z 2025-05-07T19:42:59.5939190Z Total download size: 347 k 2025-05-07T19:42:59.5939467Z Installed size: 883 k 2025-05-07T19:42:59.5939743Z Downloading Packages: 2025-05-07T19:42:59.8781372Z (1/2): hostname-3.23-4.amzn2023.0.3.x86_64.rpm 1.4 MB/s | 28 kB 00:00 2025-05-07T19:42:59.8820349Z (2/2): lshw-B.02.19.2-7.amzn2023.0.3.x86_64.rpm 13 MB/s | 319 kB 00:00 2025-05-07T19:42:59.8827771Z -------------------------------------------------------------------------------- 2025-05-07T19:42:59.8828442Z Total 1.2 MB/s | 347 kB 00:00 2025-05-07T19:42:59.9054465Z Running transaction check 2025-05-07T19:42:59.9107171Z Transaction check succeeded. 2025-05-07T19:42:59.9108047Z Running transaction test 2025-05-07T19:42:59.9264956Z Transaction test succeeded. 2025-05-07T19:42:59.9265837Z Running transaction 2025-05-07T19:42:59.9543441Z Preparing : 1/1 2025-05-07T19:42:59.9618702Z Installing : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 1/2 2025-05-07T19:42:59.9652683Z Installing : hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:01.0066408Z Running scriptlet: hostname-3.23-4.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:01.0068799Z Verifying : hostname-3.23-4.amzn2023.0.3.x86_64 1/2 2025-05-07T19:43:01.0435949Z Verifying : lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2/2 2025-05-07T19:43:01.0437010Z 2025-05-07T19:43:01.0437718Z Installed: 2025-05-07T19:43:01.0438778Z hostname-3.23-4.amzn2023.0.3.x86_64 lshw-B.02.19.2-7.amzn2023.0.3.x86_64 2025-05-07T19:43:01.0439793Z 2025-05-07T19:43:01.0440034Z Complete! 2025-05-07T19:43:01.0781980Z + hostname 2025-05-07T19:43:01.0782372Z 2025-05-07T19:43:01.0791684Z 9d6bffcb154a 2025-05-07T19:43:01.0792164Z 2025-05-07T19:43:01.0792679Z + sudo lshw -C display 2025-05-07T19:43:01.0793172Z 2025-05-07T19:43:01.2760438Z *-display UNCLAIMED 2025-05-07T19:43:01.2761346Z description: VGA compatible controller 2025-05-07T19:43:01.2762361Z product: Amazon.com, Inc. 2025-05-07T19:43:01.2763185Z vendor: Amazon.com, Inc. 2025-05-07T19:43:01.2763974Z physical id: 3 2025-05-07T19:43:01.2764670Z bus info: pci@0000:00:03.0 2025-05-07T19:43:01.2764978Z version: 00 2025-05-07T19:43:01.2765228Z width: 32 bits 2025-05-07T19:43:01.2765494Z clock: 33MHz 2025-05-07T19:43:01.2765791Z capabilities: vga_controller bus_master 2025-05-07T19:43:01.2766134Z configuration: latency=0 2025-05-07T19:43:01.2766503Z resources: memory:c0000000-c03fffff memory:c0000-dffff 2025-05-07T19:43:01.2783371Z 2025-05-07T19:43:01.2783967Z ################################################################################ 2025-05-07T19:43:01.2785098Z [INFO] Printing NVIDIA GPU info ... 2025-05-07T19:43:01.2890932Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:01.2914617Z which: no nvidia-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.2916107Z [CHECK] nvidia-smi not found 2025-05-07T19:43:01.2916978Z ################################################################################ 2025-05-07T19:43:01.2917988Z [INFO] Printing AMD GPU info ... 2025-05-07T19:43:01.3021843Z lspci: Unable to load libkmod resources: error -2 2025-05-07T19:43:01.3042487Z which: no rocminfo in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.3043916Z [CHECK] rocminfo not found 2025-05-07T19:43:01.3048827Z which: no rocm-smi in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:43:01.3049527Z [CHECK] rocm-smi not found 2025-05-07T19:43:01.3109926Z ##[group]Run . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:01.3110408Z . $PRELUDE; setup_miniconda $HOME/miniconda 2025-05-07T19:43:01.3110963Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:01.3111306Z env: 2025-05-07T19:43:01.3111530Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:01.3111847Z BUILD_ENV: build_binary 2025-05-07T19:43:01.3112088Z BUILD_TARGET: default 2025-05-07T19:43:01.3112329Z BUILD_VARIANT: cuda 2025-05-07T19:43:01.3112677Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:01.3113094Z ##[endgroup] 2025-05-07T19:43:01.7247318Z ################################################################################ 2025-05-07T19:43:01.7248415Z # Setup Miniconda 2025-05-07T19:43:01.7249046Z # 2025-05-07T19:43:01.7265792Z # [2025-05-07T19:43:01.725Z] + setup_miniconda /github/home/miniconda 2025-05-07T19:43:01.7267147Z ################################################################################ 2025-05-07T19:43:01.7267951Z 2025-05-07T19:43:01.7280703Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:01.8188160Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:01.8189268Z + mkdir -p /github/home/miniconda 2025-05-07T19:43:01.8189847Z 2025-05-07T19:43:01.8207165Z 2025-05-07T19:43:01.8207646Z [SETUP] Downloading the Miniconda installer ... 2025-05-07T19:43:01.8228733Z [EXEC] [ATTEMPT 0/3] + wget -q https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O miniconda.sh 2025-05-07T19:43:02.8585739Z [SETUP] Installing Miniconda ... 2025-05-07T19:43:02.8586834Z + bash miniconda.sh -b -p /github/home/miniconda -u 2025-05-07T19:43:02.8587628Z 2025-05-07T19:43:02.8721818Z PREFIX=/github/home/miniconda 2025-05-07T19:43:03.2297539Z Unpacking payload ... 2025-05-07T19:43:03.7076373Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:04.3793080Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:06.2330105Z 2025-05-07T19:43:06.2330660Z Installing base environment... 2025-05-07T19:43:06.2330969Z 2025-05-07T19:43:07.2239819Z Preparing transaction: ...working... done 2025-05-07T19:43:10.0695333Z Executing transaction: ...working... done 2025-05-07T19:43:10.6177772Z entry_point.py:256: DeprecationWarning: Python 3.14 will, by default, filter extracted tar archives and reject files or modify their metadata. Use the filter argument to control this behavior. 2025-05-07T19:43:10.6858783Z installation finished. 2025-05-07T19:43:10.6866567Z 2025-05-07T19:43:10.6867022Z + rm -f miniconda.sh 2025-05-07T19:43:10.6867267Z 2025-05-07T19:43:10.7053929Z 2025-05-07T19:43:10.7054351Z [SETUP] Reloading the bash configuration ... 2025-05-07T19:43:10.7054790Z + /github/home/miniconda/bin/conda init bash 2025-05-07T19:43:10.7055017Z 2025-05-07T19:43:11.0635781Z no change /github/home/miniconda/condabin/conda 2025-05-07T19:43:11.0636991Z no change /github/home/miniconda/bin/conda 2025-05-07T19:43:11.0638099Z no change /github/home/miniconda/bin/conda-env 2025-05-07T19:43:11.0639193Z no change /github/home/miniconda/bin/activate 2025-05-07T19:43:11.0640297Z no change /github/home/miniconda/bin/deactivate 2025-05-07T19:43:11.0641498Z no change /github/home/miniconda/etc/profile.d/conda.sh 2025-05-07T19:43:11.0642806Z no change /github/home/miniconda/etc/fish/conf.d/conda.fish 2025-05-07T19:43:11.0643584Z no change /github/home/miniconda/shell/condabin/Conda.psm1 2025-05-07T19:43:11.0644032Z no change /github/home/miniconda/shell/condabin/conda-hook.ps1 2025-05-07T19:43:11.0644598Z no change /github/home/miniconda/lib/python3.13/site-packages/xontrib/conda.xsh 2025-05-07T19:43:11.0645621Z no change /github/home/miniconda/etc/profile.d/conda.csh 2025-05-07T19:43:11.0646136Z modified /github/home/.bashrc 2025-05-07T19:43:11.0646323Z 2025-05-07T19:43:11.0646550Z ==> For changes to take effect, close and re-open your current shell. <== 2025-05-07T19:43:11.0646854Z 2025-05-07T19:43:11.1158417Z 2025-05-07T19:43:11.1158884Z + . /github/home/.bashrc 2025-05-07T19:43:11.1159434Z 2025-05-07T19:43:11.9132459Z 2025-05-07T19:43:11.9133464Z [SETUP] Installing libmamba-solver (required since Anaconda 2024.02-1) and libarchive ... 2025-05-07T19:43:11.9157146Z [EXEC] [ATTEMPT 0/3] + conda install --solver=classic -c conda-forge --override-channels -y conda-libmamba-solver libmamba libmambapy libarchive 2025-05-07T19:43:23.7844933Z Collecting package metadata (current_repodata.json): - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:25.2395961Z Solving environment: | / - \ | / - \ | / - done 2025-05-07T19:43:25.3285403Z 2025-05-07T19:43:25.3285735Z ## Package Plan ## 2025-05-07T19:43:25.3286421Z 2025-05-07T19:43:25.3287055Z environment location: /github/home/miniconda 2025-05-07T19:43:25.3287388Z 2025-05-07T19:43:25.3287528Z added / updated specs: 2025-05-07T19:43:25.3287841Z - conda-libmamba-solver 2025-05-07T19:43:25.3288119Z - libarchive 2025-05-07T19:43:25.3288358Z - libmamba 2025-05-07T19:43:25.3288572Z - libmambapy 2025-05-07T19:43:25.3288725Z 2025-05-07T19:43:25.3288730Z 2025-05-07T19:43:25.3288863Z The following packages will be downloaded: 2025-05-07T19:43:25.3289098Z 2025-05-07T19:43:25.3289697Z package | build 2025-05-07T19:43:25.3290059Z ---------------------------|----------------- 2025-05-07T19:43:25.3290541Z ca-certificates-2025.4.26 | hbd8a1cb_0 149 KB conda-forge 2025-05-07T19:43:25.3291051Z certifi-2025.4.26 | pyhd8ed1ab_0 154 KB conda-forge 2025-05-07T19:43:25.3291518Z conda-25.3.1 | py313h78bf25f_1 1.1 MB conda-forge 2025-05-07T19:43:25.3292018Z conda-libmamba-solver-25.4.0| pyhd8ed1ab_0 41 KB conda-forge 2025-05-07T19:43:25.3292502Z ------------------------------------------------------------ 2025-05-07T19:43:25.3292874Z Total: 1.4 MB 2025-05-07T19:43:25.3293100Z 2025-05-07T19:43:25.3293218Z The following packages will be UPDATED: 2025-05-07T19:43:25.3293436Z 2025-05-07T19:43:25.3301595Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:43:25.3302481Z conda pkgs/main::conda-25.3.1-py313h06a4308~ --> conda-forge::conda-25.3.1-py313h78bf25f_1 2025-05-07T19:43:25.3302929Z 2025-05-07T19:43:25.3303175Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:43:25.3303520Z 2025-05-07T19:43:25.3303884Z certifi pkgs/main/linux-64::certifi-2025.4.26~ --> conda-forge/noarch::certifi-2025.4.26-pyhd8ed1ab_0 2025-05-07T19:43:25.3304742Z conda-libmamba-so~ pkgs/main::conda-libmamba-solver-25.4~ --> conda-forge::conda-libmamba-solver-25.4.0-pyhd8ed1ab_0 2025-05-07T19:43:25.3305285Z 2025-05-07T19:43:25.3305289Z 2025-05-07T19:43:25.3305292Z 2025-05-07T19:43:25.3305443Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:25.3305844Z conda-25.3.1 | 1.1 MB | | 0% 2025-05-07T19:43:25.3306098Z 2025-05-07T19:43:25.3306559Z certifi-2025.4.26 | 154 KB | | 0%  2025-05-07T19:43:25.3306830Z 2025-05-07T19:43:25.3306840Z 2025-05-07T19:43:25.3314086Z ca-certificates-2025 | 149 KB | | 0%  2025-05-07T19:43:25.3314370Z 2025-05-07T19:43:25.3314374Z 2025-05-07T19:43:25.3314597Z 2025-05-07T19:43:25.3951641Z conda-libmamba-solve | 41 KB | | 0%  2025-05-07T19:43:25.3952805Z 2025-05-07T19:43:25.3952828Z 2025-05-07T19:43:25.4032955Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:25.4033278Z 2025-05-07T19:43:25.4033282Z 2025-05-07T19:43:25.4077990Z ca-certificates-2025 | 149 KB | ########## | 100%  2025-05-07T19:43:25.4078350Z 2025-05-07T19:43:25.4078356Z 2025-05-07T19:43:25.4078361Z 2025-05-07T19:43:25.4183992Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:25.4184319Z 2025-05-07T19:43:25.4184324Z 2025-05-07T19:43:25.4184428Z 2025-05-07T19:43:25.4295709Z conda-libmamba-solve | 41 KB | ########## | 100%  2025-05-07T19:43:25.4296177Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:25.4412673Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:25.4413091Z 2025-05-07T19:43:25.4460314Z certifi-2025.4.26 | 154 KB | # | 10%  2025-05-07T19:43:25.4460726Z 2025-05-07T19:43:25.4596286Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:25.4596589Z 2025-05-07T19:43:25.5329299Z certifi-2025.4.26 | 154 KB | ########## | 100%  2025-05-07T19:43:25.5331703Z conda-25.3.1 | 1.1 MB | ########## | 100% 2025-05-07T19:43:25.5332052Z 2025-05-07T19:43:25.5332292Z 2025-05-07T19:43:25.5332478Z  2025-05-07T19:43:25.5332695Z 2025-05-07T19:43:25.5332699Z 2025-05-07T19:43:25.5332871Z  2025-05-07T19:43:25.5333109Z 2025-05-07T19:43:25.5333113Z 2025-05-07T19:43:25.5333117Z 2025-05-07T19:43:25.5334839Z  done 2025-05-07T19:43:25.6342375Z Preparing transaction: | done 2025-05-07T19:43:25.7347340Z Verifying transaction: - done 2025-05-07T19:43:27.0372800Z Executing transaction: | / - \ | / - \ | / - \ | done 2025-05-07T19:43:28.5957000Z [SETUP] Updating Miniconda base packages ... 2025-05-07T19:43:28.5984287Z [EXEC] [ATTEMPT 0/3] + conda update -n base -c defaults --update-deps -y conda 2025-05-07T19:43:29.3233935Z Channels: 2025-05-07T19:43:29.3234442Z - defaults 2025-05-07T19:43:29.3234695Z Platform: linux-64 2025-05-07T19:43:30.3989693Z Collecting package metadata (repodata.json): - \ | / - \ done 2025-05-07T19:43:30.5275384Z Solving environment: / - Channels: 2025-05-07T19:43:30.5275778Z - defaults 2025-05-07T19:43:30.5276021Z Platform: linux-64 2025-05-07T19:43:30.8060132Z Collecting package metadata (repodata.json): | / - \ done 2025-05-07T19:43:31.0142595Z Solving environment: / - \ done 2025-05-07T19:43:31.1296573Z | done 2025-05-07T19:43:31.1923576Z 2025-05-07T19:43:31.1924418Z ## Package Plan ## 2025-05-07T19:43:31.1924667Z 2025-05-07T19:43:31.1924879Z environment location: /github/home/miniconda 2025-05-07T19:43:31.1925197Z 2025-05-07T19:43:31.1925332Z added / updated specs: 2025-05-07T19:43:31.1925636Z - conda 2025-05-07T19:43:31.1925809Z 2025-05-07T19:43:31.1925814Z 2025-05-07T19:43:31.1925959Z The following packages will be downloaded: 2025-05-07T19:43:31.1926212Z 2025-05-07T19:43:31.1926375Z package | build 2025-05-07T19:43:31.1926740Z ---------------------------|----------------- 2025-05-07T19:43:31.1927153Z pip-25.1 | pyhc872135_2 1.3 MB 2025-05-07T19:43:31.1927586Z tzdata-2025b | h04d1e81_0 116 KB 2025-05-07T19:43:31.1928037Z ------------------------------------------------------------ 2025-05-07T19:43:31.1928420Z Total: 1.4 MB 2025-05-07T19:43:31.1928706Z 2025-05-07T19:43:31.1928822Z The following packages will be UPDATED: 2025-05-07T19:43:31.1929046Z 2025-05-07T19:43:31.1929725Z pip pkgs/main/linux-64::pip-25.0-py313h06~ --> pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:31.1930289Z tzdata 2025a-h04d1e81_0 --> 2025b-h04d1e81_0 2025-05-07T19:43:31.1930576Z 2025-05-07T19:43:31.1930580Z 2025-05-07T19:43:31.1930584Z 2025-05-07T19:43:31.1930746Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:31.1931135Z pip-25.1 | 1.3 MB | | 0% 2025-05-07T19:43:31.1931548Z 2025-05-07T19:43:31.2281696Z tzdata-2025b | 116 KB | | 0%  2025-05-07T19:43:31.2346062Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:31.2346395Z 2025-05-07T19:43:31.4271524Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:31.4272207Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:31.4343320Z pip-25.1 | 1.3 MB | ########## | 100% 2025-05-07T19:43:31.4345828Z 2025-05-07T19:43:31.4346311Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:31.4346591Z 2025-05-07T19:43:31.4349110Z tzdata-2025b | 116 KB | ########## | 100%  2025-05-07T19:43:31.4349670Z 2025-05-07T19:43:31.4349919Z 2025-05-07T19:43:31.4350103Z  done 2025-05-07T19:43:31.5357237Z Preparing transaction: - done 2025-05-07T19:43:31.6367814Z Verifying transaction: | done 2025-05-07T19:43:33.6404748Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:43:34.1796079Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:43:34.1797343Z + conda clean --packages --tarball -y 2025-05-07T19:43:34.1797966Z 2025-05-07T19:43:34.6183666Z Will remove 99 (117.8 MB) tarball(s). 2025-05-07T19:43:34.6184901Z Will remove 11 (16.0 MB) package(s). 2025-05-07T19:43:34.6749298Z 2025-05-07T19:43:34.6753463Z + conda clean --all -y 2025-05-07T19:43:34.6754038Z 2025-05-07T19:43:35.1185880Z There are no unused tarball(s) to remove. 2025-05-07T19:43:35.1186278Z Will remove 1 index cache(s). 2025-05-07T19:43:35.1186579Z There are no unused package(s) to remove. 2025-05-07T19:43:35.1186930Z There are no tempfile(s) to remove. 2025-05-07T19:43:35.1187231Z There are no logfile(s) to remove. 2025-05-07T19:43:35.1718152Z 2025-05-07T19:43:35.1718970Z + conda info 2025-05-07T19:43:35.1719215Z 2025-05-07T19:43:35.7361323Z 2025-05-07T19:43:35.7361838Z active environment : base 2025-05-07T19:43:35.7362570Z active env location : /github/home/miniconda 2025-05-07T19:43:35.7362934Z shell level : 1 2025-05-07T19:43:35.7363267Z user config file : /github/home/.condarc 2025-05-07T19:43:35.7363686Z populated config files : /github/home/miniconda/.condarc 2025-05-07T19:43:35.7364104Z conda version : 25.3.1 2025-05-07T19:43:35.7364429Z conda-build version : not installed 2025-05-07T19:43:35.7364775Z python version : 3.13.2.final.0 2025-05-07T19:43:35.7365139Z solver : libmamba (default) 2025-05-07T19:43:35.7365481Z virtual packages : __archspec=1=cascadelake 2025-05-07T19:43:35.7365847Z __conda=25.3.1=0 2025-05-07T19:43:35.7366161Z __glibc=2.34=0 2025-05-07T19:43:35.7366496Z __linux=6.1.130=0 2025-05-07T19:43:35.7366801Z __unix=0=0 2025-05-07T19:43:35.7367183Z base environment : /github/home/miniconda (writable) 2025-05-07T19:43:35.7367608Z conda av data dir : /github/home/miniconda/etc/conda 2025-05-07T19:43:35.7368004Z conda av metadata url : None 2025-05-07T19:43:35.7368427Z channel URLs : https://repo.anaconda.com/pkgs/main/linux-64 2025-05-07T19:43:35.7368882Z https://repo.anaconda.com/pkgs/main/noarch 2025-05-07T19:43:35.7369331Z https://repo.anaconda.com/pkgs/r/linux-64 2025-05-07T19:43:35.7370117Z https://repo.anaconda.com/pkgs/r/noarch 2025-05-07T19:43:35.7370526Z package cache : /github/home/miniconda/pkgs 2025-05-07T19:43:35.7370867Z /github/home/.conda/pkgs 2025-05-07T19:43:35.7371243Z envs directories : /github/home/miniconda/envs 2025-05-07T19:43:35.7371615Z /github/home/.conda/envs 2025-05-07T19:43:35.7371924Z platform : linux-64 2025-05-07T19:43:35.7372811Z user-agent : conda/25.3.1 requests/2.32.3 CPython/3.13.2 Linux/6.1.130-139.222.amzn2023.x86_64 amzn/2023.7.20250428 glibc/2.34 solver/libmamba conda-libmamba-solver/25.4.0 libmambapy/2.0.5 aau/0.7.0 c/. s/. e/. 2025-05-07T19:43:35.7373673Z UID:GID : 0:0 2025-05-07T19:43:35.7373960Z netrc file : None 2025-05-07T19:43:35.7374227Z offline mode : False 2025-05-07T19:43:35.7374439Z 2025-05-07T19:43:35.7959561Z 2025-05-07T19:43:35.7959902Z [SETUP] Exporting Miniconda variables ... 2025-05-07T19:43:35.7960797Z [SETUP] Saving Miniconda variables to /__w/_temp/_runner_file_commands/add_path_aecda48c-53cc-4ff6-a9be-63314883f3a0 ... 2025-05-07T19:43:35.7961499Z [SETUP] Successfully set up Miniconda at /github/home/miniconda 2025-05-07T19:43:35.8116608Z ##[group]Run . $PRELUDE; create_conda_environment $BUILD_ENV 3.10 2025-05-07T19:43:35.8117151Z . $PRELUDE; create_conda_environment $BUILD_ENV 3.10 2025-05-07T19:43:35.8117862Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:43:35.8118213Z env: 2025-05-07T19:43:35.8118459Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:43:35.8118772Z BUILD_ENV: build_binary 2025-05-07T19:43:35.8119038Z BUILD_TARGET: default 2025-05-07T19:43:35.8119274Z BUILD_VARIANT: cuda 2025-05-07T19:43:35.8119526Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:43:35.8119779Z ##[endgroup] 2025-05-07T19:43:36.2523621Z ################################################################################ 2025-05-07T19:43:36.2524337Z # Create Conda Environment 2025-05-07T19:43:36.2524638Z # 2025-05-07T19:43:36.2545975Z # [2025-05-07T19:43:36.253Z] + create_conda_environment build_binary 3.10 2025-05-07T19:43:36.2547011Z ################################################################################ 2025-05-07T19:43:36.2547275Z 2025-05-07T19:43:36.2560887Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:43:36.3436153Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:43:36.3436619Z [SETUP] Listing existing Conda environments ... 2025-05-07T19:43:36.3436999Z + conda info --envs 2025-05-07T19:43:36.3437156Z 2025-05-07T19:43:36.9073203Z 2025-05-07T19:43:36.9073744Z # conda environments: 2025-05-07T19:43:36.9074134Z # 2025-05-07T19:43:36.9074388Z base /github/home/miniconda 2025-05-07T19:43:36.9074673Z 2025-05-07T19:43:36.9655703Z 2025-05-07T19:43:36.9656299Z [SETUP] Deleting the prefix directory if it exists ... 2025-05-07T19:43:38.5481915Z + rm -rf /github/home/miniconda/envs/build_binary 2025-05-07T19:43:38.5482268Z 2025-05-07T19:43:38.5499140Z 2025-05-07T19:43:38.5514484Z [SETUP] Creating new Conda environment (Python 3.10) ... 2025-05-07T19:43:38.5535361Z [EXEC] [ATTEMPT 0/3] + conda create -y -n build_binary python=3.10 2025-05-07T19:43:39.1326068Z Channels: 2025-05-07T19:43:39.1326457Z - defaults 2025-05-07T19:43:39.1326744Z Platform: linux-64 2025-05-07T19:43:40.5347584Z Collecting package metadata (repodata.json): - \ | / - \ | / - done 2025-05-07T19:43:40.6351644Z Solving environment: | done 2025-05-07T19:43:40.6638582Z 2025-05-07T19:43:40.6638926Z ## Package Plan ## 2025-05-07T19:43:40.6639329Z 2025-05-07T19:43:40.6639848Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:43:40.6640206Z 2025-05-07T19:43:40.6640319Z added / updated specs: 2025-05-07T19:43:40.6640591Z - python=3.10 2025-05-07T19:43:40.6640825Z 2025-05-07T19:43:40.6640829Z 2025-05-07T19:43:40.6640961Z The following packages will be downloaded: 2025-05-07T19:43:40.6641191Z 2025-05-07T19:43:40.6641518Z package | build 2025-05-07T19:43:40.6641875Z ---------------------------|----------------- 2025-05-07T19:43:40.6642333Z _libgcc_mutex-0.1 | main 3 KB 2025-05-07T19:43:40.6642780Z _openmp_mutex-5.1 | 1_gnu 21 KB 2025-05-07T19:43:40.6643266Z ca-certificates-2025.2.25 | h06a4308_0 129 KB 2025-05-07T19:43:40.6643740Z python-3.10.16 | he870216_1 26.9 MB 2025-05-07T19:43:40.6644175Z setuptools-78.1.1 | py310h06a4308_0 1.7 MB 2025-05-07T19:43:40.6644628Z wheel-0.45.1 | py310h06a4308_0 115 KB 2025-05-07T19:43:40.6645032Z ------------------------------------------------------------ 2025-05-07T19:43:40.6645459Z Total: 28.8 MB 2025-05-07T19:43:40.6645707Z 2025-05-07T19:43:40.6645875Z The following NEW packages will be INSTALLED: 2025-05-07T19:43:40.6646121Z 2025-05-07T19:43:40.6646366Z _libgcc_mutex pkgs/main/linux-64::_libgcc_mutex-0.1-main 2025-05-07T19:43:40.6646874Z _openmp_mutex pkgs/main/linux-64::_openmp_mutex-5.1-1_gnu 2025-05-07T19:43:40.6648006Z bzip2 pkgs/main/linux-64::bzip2-1.0.8-h5eee18b_6 2025-05-07T19:43:40.6648561Z ca-certificates pkgs/main/linux-64::ca-certificates-2025.2.25-h06a4308_0 2025-05-07T19:43:40.6649270Z ld_impl_linux-64 pkgs/main/linux-64::ld_impl_linux-64-2.40-h12ee557_0 2025-05-07T19:43:40.6649736Z libffi pkgs/main/linux-64::libffi-3.4.4-h6a678d5_1 2025-05-07T19:43:40.6650194Z libgcc-ng pkgs/main/linux-64::libgcc-ng-11.2.0-h1234567_1 2025-05-07T19:43:40.6650637Z libgomp pkgs/main/linux-64::libgomp-11.2.0-h1234567_1 2025-05-07T19:43:40.6651130Z libstdcxx-ng pkgs/main/linux-64::libstdcxx-ng-11.2.0-h1234567_1 2025-05-07T19:43:40.6651733Z libuuid pkgs/main/linux-64::libuuid-1.41.5-h5eee18b_0 2025-05-07T19:43:40.6652189Z ncurses pkgs/main/linux-64::ncurses-6.4-h6a678d5_0 2025-05-07T19:43:40.6652635Z openssl pkgs/main/linux-64::openssl-3.0.16-h5eee18b_0 2025-05-07T19:43:40.6653050Z pip pkgs/main/noarch::pip-25.1-pyhc872135_2 2025-05-07T19:43:40.6653482Z python pkgs/main/linux-64::python-3.10.16-he870216_1 2025-05-07T19:43:40.6653915Z readline pkgs/main/linux-64::readline-8.2-h5eee18b_0 2025-05-07T19:43:40.6654415Z setuptools pkgs/main/linux-64::setuptools-78.1.1-py310h06a4308_0 2025-05-07T19:43:40.6655106Z sqlite pkgs/main/linux-64::sqlite-3.45.3-h5eee18b_0 2025-05-07T19:43:40.6655519Z tk pkgs/main/linux-64::tk-8.6.14-h39e8969_0 2025-05-07T19:43:40.6655955Z tzdata pkgs/main/noarch::tzdata-2025b-h04d1e81_0 2025-05-07T19:43:40.6656403Z wheel pkgs/main/linux-64::wheel-0.45.1-py310h06a4308_0 2025-05-07T19:43:40.6656858Z xz pkgs/main/linux-64::xz-5.6.4-h5eee18b_1 2025-05-07T19:43:40.6657279Z zlib pkgs/main/linux-64::zlib-1.2.13-h5eee18b_1 2025-05-07T19:43:40.6657540Z 2025-05-07T19:43:40.6657544Z 2025-05-07T19:43:40.6657547Z 2025-05-07T19:43:40.6657708Z Downloading and Extracting Packages: ...working... 2025-05-07T19:43:40.6658317Z python-3.10.16 | 26.9 MB | | 0% 2025-05-07T19:43:40.6658567Z 2025-05-07T19:43:40.6658912Z setuptools-78.1.1 | 1.7 MB | | 0%  2025-05-07T19:43:40.6659207Z 2025-05-07T19:43:40.6659211Z 2025-05-07T19:43:40.6684011Z ca-certificates-2025 | 129 KB | | 0%  2025-05-07T19:43:40.6684890Z 2025-05-07T19:43:40.6684950Z 2025-05-07T19:43:40.6684962Z 2025-05-07T19:43:40.6690936Z wheel-0.45.1 | 115 KB | | 0%  2025-05-07T19:43:40.6691703Z 2025-05-07T19:43:40.6691727Z 2025-05-07T19:43:40.6691775Z 2025-05-07T19:43:40.6691786Z 2025-05-07T19:43:40.6697994Z _openmp_mutex-5.1 | 21 KB | | 0%  2025-05-07T19:43:40.6698302Z 2025-05-07T19:43:40.6698305Z 2025-05-07T19:43:40.6698309Z 2025-05-07T19:43:40.6698313Z 2025-05-07T19:43:40.6698316Z 2025-05-07T19:43:40.7115540Z _libgcc_mutex-0.1 | 3 KB | | 0%  2025-05-07T19:43:40.7116460Z 2025-05-07T19:43:40.7116474Z 2025-05-07T19:43:40.7116485Z 2025-05-07T19:43:40.7116495Z 2025-05-07T19:43:40.7276214Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:40.7276538Z 2025-05-07T19:43:40.7276543Z 2025-05-07T19:43:40.7354560Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:40.7354916Z 2025-05-07T19:43:40.7354921Z 2025-05-07T19:43:40.7354924Z 2025-05-07T19:43:40.7354928Z 2025-05-07T19:43:40.7354931Z 2025-05-07T19:43:40.7465871Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:40.7466223Z 2025-05-07T19:43:40.7466250Z 2025-05-07T19:43:40.7466254Z 2025-05-07T19:43:40.7501394Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:40.7501703Z 2025-05-07T19:43:40.7501892Z 2025-05-07T19:43:40.7501902Z 2025-05-07T19:43:40.7501920Z 2025-05-07T19:43:40.7501923Z 2025-05-07T19:43:40.7504960Z _libgcc_mutex-0.1 | 3 KB | ########## | 100%  2025-05-07T19:43:40.7505357Z 2025-05-07T19:43:40.7505375Z 2025-05-07T19:43:40.7569962Z ca-certificates-2025 | 129 KB | ########## | 100%  2025-05-07T19:43:40.7570286Z 2025-05-07T19:43:40.7570291Z 2025-05-07T19:43:40.7570321Z 2025-05-07T19:43:40.7570325Z 2025-05-07T19:43:40.7642192Z _openmp_mutex-5.1 | 21 KB | ########## | 100%  2025-05-07T19:43:40.7715698Z python-3.10.16 | 26.9 MB | #4 | 15% 2025-05-07T19:43:40.7715986Z 2025-05-07T19:43:40.7920753Z setuptools-78.1.1 | 1.7 MB | | 1%  2025-05-07T19:43:40.7921081Z 2025-05-07T19:43:40.7921086Z 2025-05-07T19:43:40.7921328Z 2025-05-07T19:43:40.7922041Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:40.7922316Z 2025-05-07T19:43:40.7922321Z 2025-05-07T19:43:40.7922330Z 2025-05-07T19:43:40.8039931Z wheel-0.45.1 | 115 KB | ########## | 100%  2025-05-07T19:43:40.8040258Z 2025-05-07T19:43:40.8642010Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:41.0264514Z python-3.10.16 | 26.9 MB | #######2 | 73% 2025-05-07T19:43:41.0265321Z 2025-05-07T19:43:41.0266147Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:41.0266966Z 2025-05-07T19:43:41.0534430Z setuptools-78.1.1 | 1.7 MB | ########## | 100%  2025-05-07T19:43:41.5601404Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:41.5605893Z python-3.10.16 | 26.9 MB | ########## | 100% 2025-05-07T19:43:41.5606800Z 2025-05-07T19:43:41.5607030Z 2025-05-07T19:43:41.5607360Z  2025-05-07T19:43:41.5607625Z 2025-05-07T19:43:41.5607629Z 2025-05-07T19:43:41.5607821Z  2025-05-07T19:43:41.5608058Z 2025-05-07T19:43:41.5608063Z 2025-05-07T19:43:41.5608068Z 2025-05-07T19:43:41.5608311Z  2025-05-07T19:43:41.5608549Z 2025-05-07T19:43:41.5608553Z 2025-05-07T19:43:41.5608557Z 2025-05-07T19:43:41.5608560Z 2025-05-07T19:43:41.5608796Z  2025-05-07T19:43:41.5609051Z 2025-05-07T19:43:41.5609055Z 2025-05-07T19:43:41.5609058Z 2025-05-07T19:43:41.5609062Z 2025-05-07T19:43:41.5609065Z 2025-05-07T19:43:41.5609276Z  done 2025-05-07T19:43:41.7721396Z Preparing transaction: - \ done 2025-05-07T19:43:43.0118453Z Verifying transaction: / - \ | / - \ | / - \ | done 2025-05-07T19:43:45.1265808Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:43:45.1304813Z # 2025-05-07T19:43:45.1305537Z # To activate this environment, use 2025-05-07T19:43:45.1306392Z # 2025-05-07T19:43:45.1306946Z # $ conda activate build_binary 2025-05-07T19:43:45.1307768Z # 2025-05-07T19:43:45.1308366Z # To deactivate an active environment, use 2025-05-07T19:43:45.1309221Z # 2025-05-07T19:43:45.1309739Z # $ conda deactivate 2025-05-07T19:43:45.1310204Z 2025-05-07T19:43:45.2153809Z [SETUP] Upgrading PIP to latest ... 2025-05-07T19:43:45.2185715Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --upgrade pip 2025-05-07T19:43:48.1515476Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:43:48.1517178Z 2025-05-07T19:43:48.1517579Z Requirement already satisfied: pip in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (25.1) 2025-05-07T19:43:48.1518185Z Collecting pip 2025-05-07T19:43:48.1518893Z Downloading pip-25.1.1-py3-none-any.whl.metadata (3.6 kB) 2025-05-07T19:43:48.1519365Z Downloading pip-25.1.1-py3-none-any.whl (1.8 MB) 2025-05-07T19:43:48.1520239Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.8/1.8 MB 66.2 MB/s eta 0:00:00 2025-05-07T19:43:48.1520646Z Installing collected packages: pip 2025-05-07T19:43:48.1520985Z Attempting uninstall: pip 2025-05-07T19:43:48.1521277Z Found existing installation: pip 25.1 2025-05-07T19:43:48.1521623Z Uninstalling pip-25.1: 2025-05-07T19:43:48.1521913Z Successfully uninstalled pip-25.1 2025-05-07T19:43:48.1522256Z Successfully installed pip-25.1.1 2025-05-07T19:43:48.1522450Z 2025-05-07T19:43:48.2281665Z [SETUP] Upgrading pyOpenSSL ... 2025-05-07T19:43:48.2307681Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y pyOpenSSL>22.1.0 2025-05-07T19:43:48.8896128Z Channels: 2025-05-07T19:43:48.8897055Z - conda-forge 2025-05-07T19:43:48.8897386Z Platform: linux-64 2025-05-07T19:43:58.5720297Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / - done 2025-05-07T19:44:00.4160595Z Solving environment: | / - \ | done 2025-05-07T19:44:00.4605204Z 2025-05-07T19:44:00.4605843Z ## Package Plan ## 2025-05-07T19:44:00.4606321Z 2025-05-07T19:44:00.4606929Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:00.4607856Z 2025-05-07T19:44:00.4608167Z added / updated specs: 2025-05-07T19:44:00.4608945Z - pyopenssl[version='>22.1.0'] 2025-05-07T19:44:00.4609550Z 2025-05-07T19:44:00.4609562Z 2025-05-07T19:44:00.4609930Z The following packages will be downloaded: 2025-05-07T19:44:00.4610194Z 2025-05-07T19:44:00.4610350Z package | build 2025-05-07T19:44:00.4610742Z ---------------------------|----------------- 2025-05-07T19:44:00.4611172Z cffi-1.17.1 | py310h8deb56e_0 238 KB conda-forge 2025-05-07T19:44:00.4611722Z cryptography-44.0.3 | py310h6c63255_0 1.5 MB conda-forge 2025-05-07T19:44:00.4612233Z libgcc-15.1.0 | h767d61c_2 810 KB conda-forge 2025-05-07T19:44:00.4612718Z libgcc-ng-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:44:00.4613387Z libgomp-15.1.0 | h767d61c_2 442 KB conda-forge 2025-05-07T19:44:00.4613836Z openssl-3.5.0 | h7b32b05_1 3.0 MB conda-forge 2025-05-07T19:44:00.4614275Z pycparser-2.22 | pyh29332c3_1 108 KB conda-forge 2025-05-07T19:44:00.4614745Z pyopenssl-25.0.0 | pyhd8ed1ab_0 120 KB conda-forge 2025-05-07T19:44:00.4615189Z python_abi-3.10 | 2_cp310 4 KB conda-forge 2025-05-07T19:44:00.4615677Z typing-extensions-4.13.2 | h0e9735f_0 88 KB conda-forge 2025-05-07T19:44:00.4616194Z typing_extensions-4.13.2 | pyh29332c3_0 51 KB conda-forge 2025-05-07T19:44:00.4616636Z ------------------------------------------------------------ 2025-05-07T19:44:00.4617015Z Total: 6.3 MB 2025-05-07T19:44:00.4617233Z 2025-05-07T19:44:00.4617364Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:00.4617614Z 2025-05-07T19:44:00.4617841Z cffi conda-forge/linux-64::cffi-1.17.1-py310h8deb56e_0 2025-05-07T19:44:00.4618379Z cryptography conda-forge/linux-64::cryptography-44.0.3-py310h6c63255_0 2025-05-07T19:44:00.4618887Z libgcc conda-forge/linux-64::libgcc-15.1.0-h767d61c_2 2025-05-07T19:44:00.4619375Z pycparser conda-forge/noarch::pycparser-2.22-pyh29332c3_1 2025-05-07T19:44:00.4619869Z pyopenssl conda-forge/noarch::pyopenssl-25.0.0-pyhd8ed1ab_0 2025-05-07T19:44:00.4620547Z python_abi conda-forge/linux-64::python_abi-3.10-2_cp310 2025-05-07T19:44:00.4623632Z typing-extensions conda-forge/noarch::typing-extensions-4.13.2-h0e9735f_0 2025-05-07T19:44:00.4624490Z typing_extensions conda-forge/noarch::typing_extensions-4.13.2-pyh29332c3_0 2025-05-07T19:44:00.4624868Z 2025-05-07T19:44:00.4625027Z The following packages will be UPDATED: 2025-05-07T19:44:00.4625254Z 2025-05-07T19:44:00.4625692Z ca-certificates pkgs/main/linux-64::ca-certificates-2~ --> conda-forge/noarch::ca-certificates-2025.4.26-hbd8a1cb_0 2025-05-07T19:44:00.4626577Z libgcc-ng pkgs/main::libgcc-ng-11.2.0-h1234567_1 --> conda-forge::libgcc-ng-15.1.0-h69a702a_2 2025-05-07T19:44:00.4627320Z libgomp pkgs/main::libgomp-11.2.0-h1234567_1 --> conda-forge::libgomp-15.1.0-h767d61c_2 2025-05-07T19:44:00.4628139Z openssl pkgs/main::openssl-3.0.16-h5eee18b_0 --> conda-forge::openssl-3.5.0-h7b32b05_1 2025-05-07T19:44:00.4628607Z 2025-05-07T19:44:00.4628612Z 2025-05-07T19:44:00.4628616Z 2025-05-07T19:44:00.4628781Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:00.4629221Z openssl-3.5.0 | 3.0 MB | | 0% 2025-05-07T19:44:00.4629480Z 2025-05-07T19:44:00.4629832Z cryptography-44.0.3 | 1.5 MB | | 0%  2025-05-07T19:44:00.4630146Z 2025-05-07T19:44:00.4630149Z 2025-05-07T19:44:00.4630371Z libgcc-15.1.0 | 810 KB | | 0%  2025-05-07T19:44:00.4630628Z 2025-05-07T19:44:00.4630632Z 2025-05-07T19:44:00.4630635Z 2025-05-07T19:44:00.4656488Z libgomp-15.1.0 | 442 KB | | 0%  2025-05-07T19:44:00.4657341Z 2025-05-07T19:44:00.4657355Z 2025-05-07T19:44:00.4657381Z 2025-05-07T19:44:00.4657392Z 2025-05-07T19:44:00.4673383Z cffi-1.17.1 | 238 KB | | 0%  2025-05-07T19:44:00.4674136Z 2025-05-07T19:44:00.4674178Z 2025-05-07T19:44:00.4674182Z 2025-05-07T19:44:00.4674314Z 2025-05-07T19:44:00.4674321Z 2025-05-07T19:44:00.4675113Z pyopenssl-25.0.0 | 120 KB | | 0%  2025-05-07T19:44:00.4675449Z 2025-05-07T19:44:00.4675517Z 2025-05-07T19:44:00.4675522Z 2025-05-07T19:44:00.4675553Z 2025-05-07T19:44:00.4675557Z 2025-05-07T19:44:00.4675561Z 2025-05-07T19:44:00.4675856Z pycparser-2.22 | 108 KB | | 0%  2025-05-07T19:44:00.4676159Z 2025-05-07T19:44:00.4676163Z 2025-05-07T19:44:00.4676167Z 2025-05-07T19:44:00.4676171Z 2025-05-07T19:44:00.4676203Z 2025-05-07T19:44:00.4676208Z 2025-05-07T19:44:00.4676211Z 2025-05-07T19:44:00.4676502Z typing-extensions-4. | 88 KB | | 0%  2025-05-07T19:44:00.4676842Z 2025-05-07T19:44:00.4676847Z 2025-05-07T19:44:00.4676851Z 2025-05-07T19:44:00.4676856Z 2025-05-07T19:44:00.4676859Z 2025-05-07T19:44:00.4676873Z 2025-05-07T19:44:00.4676876Z 2025-05-07T19:44:00.4676880Z 2025-05-07T19:44:00.4677182Z typing_extensions-4. | 51 KB | | 0%  2025-05-07T19:44:00.4677506Z 2025-05-07T19:44:00.4677510Z 2025-05-07T19:44:00.4677513Z 2025-05-07T19:44:00.4677517Z 2025-05-07T19:44:00.4677520Z 2025-05-07T19:44:00.4677524Z 2025-05-07T19:44:00.4677532Z 2025-05-07T19:44:00.4677536Z 2025-05-07T19:44:00.4677539Z 2025-05-07T19:44:00.4677850Z libgcc-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:00.4678154Z 2025-05-07T19:44:00.4678158Z 2025-05-07T19:44:00.4678161Z 2025-05-07T19:44:00.4678165Z 2025-05-07T19:44:00.4678168Z 2025-05-07T19:44:00.4678172Z 2025-05-07T19:44:00.4678175Z 2025-05-07T19:44:00.4678179Z 2025-05-07T19:44:00.4678183Z 2025-05-07T19:44:00.4678211Z 2025-05-07T19:44:00.5222902Z python_abi-3.10 | 4 KB | | 0%  2025-05-07T19:44:00.5223251Z 2025-05-07T19:44:00.5223699Z 2025-05-07T19:44:00.5223715Z 2025-05-07T19:44:00.5223723Z 2025-05-07T19:44:00.5310826Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:00.5311185Z 2025-05-07T19:44:00.5538473Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:00.5538816Z 2025-05-07T19:44:00.5538853Z 2025-05-07T19:44:00.5540016Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:00.5540297Z 2025-05-07T19:44:00.5540301Z 2025-05-07T19:44:00.5540315Z 2025-05-07T19:44:00.5628815Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:00.5689330Z openssl-3.5.0 | 3.0 MB | ####9 | 49% 2025-05-07T19:44:00.5689892Z 2025-05-07T19:44:00.5689902Z 2025-05-07T19:44:00.5689907Z 2025-05-07T19:44:00.5689911Z 2025-05-07T19:44:00.5689921Z 2025-05-07T19:44:00.5693953Z pyopenssl-25.0.0 | 120 KB | #3 | 13%  2025-05-07T19:44:00.5694281Z 2025-05-07T19:44:00.5694298Z 2025-05-07T19:44:00.5694573Z 2025-05-07T19:44:00.5694578Z 2025-05-07T19:44:00.5694582Z 2025-05-07T19:44:00.5694586Z 2025-05-07T19:44:00.5731107Z pycparser-2.22 | 108 KB | #4 | 15%  2025-05-07T19:44:00.5732486Z 2025-05-07T19:44:00.5732533Z 2025-05-07T19:44:00.5732544Z 2025-05-07T19:44:00.5732555Z 2025-05-07T19:44:00.5732566Z 2025-05-07T19:44:00.5732612Z 2025-05-07T19:44:00.5748234Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:00.5748557Z 2025-05-07T19:44:00.5748561Z 2025-05-07T19:44:00.5748564Z 2025-05-07T19:44:00.5748568Z 2025-05-07T19:44:00.5753366Z 2025-05-07T19:44:00.5795675Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:00.5796007Z 2025-05-07T19:44:00.5796014Z 2025-05-07T19:44:00.5796020Z 2025-05-07T19:44:00.5796028Z 2025-05-07T19:44:00.5799566Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:00.5799850Z 2025-05-07T19:44:00.5799855Z 2025-05-07T19:44:00.5799904Z 2025-05-07T19:44:00.5799911Z 2025-05-07T19:44:00.5938904Z cffi-1.17.1 | 238 KB | ########## | 100%  2025-05-07T19:44:00.5939244Z 2025-05-07T19:44:00.5939343Z 2025-05-07T19:44:00.5939349Z 2025-05-07T19:44:00.5939354Z 2025-05-07T19:44:00.5939360Z 2025-05-07T19:44:00.5939365Z 2025-05-07T19:44:00.5939370Z 2025-05-07T19:44:00.5961784Z typing-extensions-4. | 88 KB | #8 | 18%  2025-05-07T19:44:00.5962805Z 2025-05-07T19:44:00.5962848Z 2025-05-07T19:44:00.5962860Z 2025-05-07T19:44:00.5962872Z 2025-05-07T19:44:00.5962884Z 2025-05-07T19:44:00.5962896Z 2025-05-07T19:44:00.5962937Z 2025-05-07T19:44:00.5995598Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:00.5995976Z 2025-05-07T19:44:00.5995983Z 2025-05-07T19:44:00.5995989Z 2025-05-07T19:44:00.5995995Z 2025-05-07T19:44:00.5995998Z 2025-05-07T19:44:00.5996004Z 2025-05-07T19:44:00.5996008Z 2025-05-07T19:44:00.5996038Z 2025-05-07T19:44:00.6025036Z typing_extensions-4. | 51 KB | ###1 | 31%  2025-05-07T19:44:00.6026162Z 2025-05-07T19:44:00.6026175Z 2025-05-07T19:44:00.6026186Z 2025-05-07T19:44:00.6026196Z 2025-05-07T19:44:00.6026207Z 2025-05-07T19:44:00.6026217Z 2025-05-07T19:44:00.6026228Z 2025-05-07T19:44:00.6026267Z 2025-05-07T19:44:00.6092489Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:00.6092853Z 2025-05-07T19:44:00.6092857Z 2025-05-07T19:44:00.6092861Z 2025-05-07T19:44:00.6092865Z 2025-05-07T19:44:00.6092995Z 2025-05-07T19:44:00.6093006Z 2025-05-07T19:44:00.6093012Z 2025-05-07T19:44:00.6093018Z 2025-05-07T19:44:00.6093023Z 2025-05-07T19:44:00.6099615Z 2025-05-07T19:44:00.6100254Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:00.6100583Z 2025-05-07T19:44:00.6100587Z 2025-05-07T19:44:00.6100591Z 2025-05-07T19:44:00.6100595Z 2025-05-07T19:44:00.6100615Z 2025-05-07T19:44:00.6100619Z 2025-05-07T19:44:00.6100648Z 2025-05-07T19:44:00.6100653Z 2025-05-07T19:44:00.6100658Z 2025-05-07T19:44:00.6100692Z 2025-05-07T19:44:00.6119532Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:00.6119845Z 2025-05-07T19:44:00.6119850Z 2025-05-07T19:44:00.6119867Z 2025-05-07T19:44:00.6119871Z 2025-05-07T19:44:00.6120248Z 2025-05-07T19:44:00.6120253Z 2025-05-07T19:44:00.6120256Z 2025-05-07T19:44:00.6120291Z 2025-05-07T19:44:00.6120296Z 2025-05-07T19:44:00.6132190Z libgcc-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:00.6132512Z 2025-05-07T19:44:00.6132530Z 2025-05-07T19:44:00.6132534Z 2025-05-07T19:44:00.6132537Z 2025-05-07T19:44:00.6132541Z 2025-05-07T19:44:00.6132544Z 2025-05-07T19:44:00.6132571Z 2025-05-07T19:44:00.6132575Z 2025-05-07T19:44:00.6132579Z 2025-05-07T19:44:00.6245476Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:00.6311423Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:00.6312924Z 2025-05-07T19:44:00.6312938Z 2025-05-07T19:44:00.6312949Z 2025-05-07T19:44:00.6313801Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:00.6314621Z 2025-05-07T19:44:00.6314632Z 2025-05-07T19:44:00.6314643Z 2025-05-07T19:44:00.6355655Z libgomp-15.1.0 | 442 KB | ########## | 100%  2025-05-07T19:44:00.6356539Z 2025-05-07T19:44:00.6356553Z 2025-05-07T19:44:00.6357308Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:00.6357688Z 2025-05-07T19:44:00.6357691Z 2025-05-07T19:44:00.6623828Z libgcc-15.1.0 | 810 KB | ########## | 100%  2025-05-07T19:44:00.6624700Z 2025-05-07T19:44:00.6624715Z 2025-05-07T19:44:00.6624726Z 2025-05-07T19:44:00.6624736Z 2025-05-07T19:44:00.6624747Z 2025-05-07T19:44:00.6733534Z pyopenssl-25.0.0 | 120 KB | ########## | 100%  2025-05-07T19:44:00.6734523Z 2025-05-07T19:44:00.6734537Z 2025-05-07T19:44:00.6734582Z 2025-05-07T19:44:00.6734594Z 2025-05-07T19:44:00.6734604Z 2025-05-07T19:44:00.6734615Z 2025-05-07T19:44:00.6734626Z 2025-05-07T19:44:00.6833667Z typing-extensions-4. | 88 KB | ########## | 100%  2025-05-07T19:44:00.6834732Z 2025-05-07T19:44:00.6834746Z 2025-05-07T19:44:00.6834758Z 2025-05-07T19:44:00.6834804Z 2025-05-07T19:44:00.6834816Z 2025-05-07T19:44:00.6834826Z 2025-05-07T19:44:00.6834837Z 2025-05-07T19:44:00.6834847Z 2025-05-07T19:44:00.6903545Z typing_extensions-4. | 51 KB | ########## | 100%  2025-05-07T19:44:00.6904597Z 2025-05-07T19:44:00.6904611Z 2025-05-07T19:44:00.6904622Z 2025-05-07T19:44:00.6904632Z 2025-05-07T19:44:00.6904642Z 2025-05-07T19:44:00.6904653Z 2025-05-07T19:44:00.6904663Z 2025-05-07T19:44:00.6904673Z 2025-05-07T19:44:00.6904684Z 2025-05-07T19:44:00.6904694Z 2025-05-07T19:44:00.7093880Z python_abi-3.10 | 4 KB | ########## | 100%  2025-05-07T19:44:00.7094249Z 2025-05-07T19:44:00.7094254Z 2025-05-07T19:44:00.7094258Z 2025-05-07T19:44:00.7094292Z 2025-05-07T19:44:00.7094296Z 2025-05-07T19:44:00.7094299Z 2025-05-07T19:44:00.7094569Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:00.7094862Z 2025-05-07T19:44:00.7094866Z 2025-05-07T19:44:00.7094869Z 2025-05-07T19:44:00.7094883Z 2025-05-07T19:44:00.7094886Z 2025-05-07T19:44:00.7094890Z 2025-05-07T19:44:00.7229481Z pycparser-2.22 | 108 KB | ########## | 100%  2025-05-07T19:44:00.7230437Z 2025-05-07T19:44:00.7230451Z 2025-05-07T19:44:00.7230462Z 2025-05-07T19:44:00.7230473Z 2025-05-07T19:44:00.7230484Z 2025-05-07T19:44:00.7230495Z 2025-05-07T19:44:00.7230505Z 2025-05-07T19:44:00.7230516Z 2025-05-07T19:44:00.7230563Z 2025-05-07T19:44:00.7231872Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:00.7233089Z 2025-05-07T19:44:00.7233101Z 2025-05-07T19:44:00.7233111Z 2025-05-07T19:44:00.7233160Z 2025-05-07T19:44:00.7233171Z 2025-05-07T19:44:00.7233182Z 2025-05-07T19:44:00.7233192Z 2025-05-07T19:44:00.7233202Z 2025-05-07T19:44:00.7233213Z 2025-05-07T19:44:00.7545923Z libgcc-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:00.7546251Z 2025-05-07T19:44:00.7546804Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:00.7547119Z 2025-05-07T19:44:00.7943115Z cryptography-44.0.3 | 1.5 MB | ########## | 100%  2025-05-07T19:44:00.7943974Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:00.7944887Z openssl-3.5.0 | 3.0 MB | ########## | 100% 2025-05-07T19:44:00.7945288Z 2025-05-07T19:44:00.7945510Z 2025-05-07T19:44:00.7945704Z  2025-05-07T19:44:00.7945949Z 2025-05-07T19:44:00.7945953Z 2025-05-07T19:44:00.7946165Z  2025-05-07T19:44:00.7947967Z 2025-05-07T19:44:00.7947972Z 2025-05-07T19:44:00.7947976Z 2025-05-07T19:44:00.7948180Z  2025-05-07T19:44:00.7948410Z 2025-05-07T19:44:00.7948414Z 2025-05-07T19:44:00.7948589Z 2025-05-07T19:44:00.7948593Z 2025-05-07T19:44:00.7948821Z  2025-05-07T19:44:00.7949059Z 2025-05-07T19:44:00.7949063Z 2025-05-07T19:44:00.7949066Z 2025-05-07T19:44:00.7949070Z 2025-05-07T19:44:00.7949073Z 2025-05-07T19:44:00.7949263Z  2025-05-07T19:44:00.7949524Z 2025-05-07T19:44:00.7949529Z 2025-05-07T19:44:00.7949532Z 2025-05-07T19:44:00.7949536Z 2025-05-07T19:44:00.7949539Z 2025-05-07T19:44:00.7949543Z 2025-05-07T19:44:00.7949738Z  2025-05-07T19:44:00.7950002Z 2025-05-07T19:44:00.7950006Z 2025-05-07T19:44:00.7950009Z 2025-05-07T19:44:00.7950018Z 2025-05-07T19:44:00.7950021Z 2025-05-07T19:44:00.7950025Z 2025-05-07T19:44:00.7950028Z 2025-05-07T19:44:00.7950253Z  2025-05-07T19:44:00.7950525Z 2025-05-07T19:44:00.7950529Z 2025-05-07T19:44:00.7950532Z 2025-05-07T19:44:00.7950536Z 2025-05-07T19:44:00.7950544Z 2025-05-07T19:44:00.7950547Z 2025-05-07T19:44:00.7950551Z 2025-05-07T19:44:00.7950555Z 2025-05-07T19:44:00.7950753Z  2025-05-07T19:44:00.7950999Z 2025-05-07T19:44:00.7951030Z 2025-05-07T19:44:00.7951033Z 2025-05-07T19:44:00.7951037Z 2025-05-07T19:44:00.7951040Z 2025-05-07T19:44:00.7951044Z 2025-05-07T19:44:00.7951047Z 2025-05-07T19:44:00.7951051Z 2025-05-07T19:44:00.7951054Z 2025-05-07T19:44:00.7951258Z  2025-05-07T19:44:00.7951510Z 2025-05-07T19:44:00.7951515Z 2025-05-07T19:44:00.7951547Z 2025-05-07T19:44:00.7951551Z 2025-05-07T19:44:00.7951555Z 2025-05-07T19:44:00.7951558Z 2025-05-07T19:44:00.7951562Z 2025-05-07T19:44:00.7951565Z 2025-05-07T19:44:00.7951569Z 2025-05-07T19:44:00.7951572Z 2025-05-07T19:44:00.7951794Z  done 2025-05-07T19:44:00.8957403Z Preparing transaction: - done 2025-05-07T19:44:00.9969481Z Verifying transaction: | done 2025-05-07T19:44:02.3999695Z Executing transaction: - \ | / - \ | / - \ | / - \ done 2025-05-07T19:44:02.4977532Z [SETUP] Testing pyOpenSSL import ... 2025-05-07T19:44:04.1869869Z [CHECK] Python (sub-)package 'OpenSSL' found ... 2025-05-07T19:44:04.1877576Z [SETUP] Installing libxcrypt ... 2025-05-07T19:44:04.1905274Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y libxcrypt 2025-05-07T19:44:04.8538651Z Channels: 2025-05-07T19:44:04.8539953Z - conda-forge 2025-05-07T19:44:04.8540721Z Platform: linux-64 2025-05-07T19:44:07.9187767Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:08.3421533Z Solving environment: \ done 2025-05-07T19:44:08.3903120Z 2025-05-07T19:44:08.3903693Z ## Package Plan ## 2025-05-07T19:44:08.3904168Z 2025-05-07T19:44:08.3905285Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:08.3906229Z 2025-05-07T19:44:08.3906516Z added / updated specs: 2025-05-07T19:44:08.3907261Z - libxcrypt 2025-05-07T19:44:08.3907648Z 2025-05-07T19:44:08.3907660Z 2025-05-07T19:44:08.3908042Z The following packages will be downloaded: 2025-05-07T19:44:08.3908548Z 2025-05-07T19:44:08.3908674Z package | build 2025-05-07T19:44:08.3909048Z ---------------------------|----------------- 2025-05-07T19:44:08.3909458Z libxcrypt-4.4.36 | hd590300_1 98 KB conda-forge 2025-05-07T19:44:08.3909922Z ------------------------------------------------------------ 2025-05-07T19:44:08.3911284Z Total: 98 KB 2025-05-07T19:44:08.3911544Z 2025-05-07T19:44:08.3911685Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:08.3911927Z 2025-05-07T19:44:08.3912191Z libxcrypt conda-forge/linux-64::libxcrypt-4.4.36-hd590300_1 2025-05-07T19:44:08.3912658Z 2025-05-07T19:44:08.3912662Z 2025-05-07T19:44:08.3912666Z 2025-05-07T19:44:08.3912815Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:08.5423666Z libxcrypt-4.4.36 | 98 KB | | 0% 2025-05-07T19:44:08.5455915Z libxcrypt-4.4.36 | 98 KB | #6 | 16% 2025-05-07T19:44:08.5573189Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:08.5574457Z libxcrypt-4.4.36 | 98 KB | ########## | 100% 2025-05-07T19:44:08.5575487Z 2025-05-07T19:44:08.5576359Z done 2025-05-07T19:44:08.6584418Z Preparing transaction: / done 2025-05-07T19:44:08.7594767Z Verifying transaction: \ done 2025-05-07T19:44:08.8601286Z Executing transaction: / done 2025-05-07T19:44:12.1503255Z [SETUP] Copying over ... 2025-05-07T19:44:12.1505457Z + cp /github/home/miniconda/envs/build_binary/include/crypt.h /github/home/miniconda/envs/build_binary/include/python3.10/crypt.h 2025-05-07T19:44:12.1507312Z 2025-05-07T19:44:12.1538380Z 2025-05-07T19:44:13.7523248Z [SETUP] Installed Python version: Python 3.10.16 2025-05-07T19:44:13.7524831Z [SETUP] Successfully created Conda environment: build_binary 2025-05-07T19:44:13.7582407Z ##[group]Run . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:13.7582921Z . $PRELUDE; install_cxx_compiler $BUILD_ENV clang 2025-05-07T19:44:13.7583491Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:44:13.7583836Z env: 2025-05-07T19:44:13.7584063Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:44:13.7584391Z BUILD_ENV: build_binary 2025-05-07T19:44:13.7584646Z BUILD_TARGET: default 2025-05-07T19:44:13.7584923Z BUILD_VARIANT: cuda 2025-05-07T19:44:13.7585163Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:44:13.7585429Z ##[endgroup] 2025-05-07T19:44:14.2104430Z ################################################################################ 2025-05-07T19:44:14.2105173Z # Install C/C++ Compilers 2025-05-07T19:44:14.2105489Z # 2025-05-07T19:44:14.2127222Z # [2025-05-07T19:44:14.211Z] + install_cxx_compiler build_binary clang 2025-05-07T19:44:14.2128656Z ################################################################################ 2025-05-07T19:44:14.2129353Z 2025-05-07T19:44:14.2143157Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:44:14.3073188Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:44:14.3077530Z [INSTALL] Installing GLIBC (architecture = 64) ... 2025-05-07T19:44:14.3101228Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y sysroot_linux-64=2.17 2025-05-07T19:44:14.9714034Z Channels: 2025-05-07T19:44:14.9714702Z - conda-forge 2025-05-07T19:44:14.9715328Z Platform: linux-64 2025-05-07T19:44:18.0670748Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:18.4962762Z Solving environment: \ | done 2025-05-07T19:44:18.5439234Z 2025-05-07T19:44:18.5439909Z ## Package Plan ## 2025-05-07T19:44:18.5440263Z 2025-05-07T19:44:18.5440496Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:18.5440986Z 2025-05-07T19:44:18.5441091Z added / updated specs: 2025-05-07T19:44:18.5441414Z - sysroot_linux-64=2.17 2025-05-07T19:44:18.5441590Z 2025-05-07T19:44:18.5441594Z 2025-05-07T19:44:18.5441723Z The following packages will be downloaded: 2025-05-07T19:44:18.5441971Z 2025-05-07T19:44:18.5442091Z package | build 2025-05-07T19:44:18.5442446Z ---------------------------|----------------- 2025-05-07T19:44:18.5442902Z kernel-headers_linux-64-3.10.0| he073ed8_18 921 KB conda-forge 2025-05-07T19:44:18.5443852Z sysroot_linux-64-2.17 | h0157908_18 14.5 MB conda-forge 2025-05-07T19:44:18.5444311Z ------------------------------------------------------------ 2025-05-07T19:44:18.5444724Z Total: 15.4 MB 2025-05-07T19:44:18.5444958Z 2025-05-07T19:44:18.5445129Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:18.5445376Z 2025-05-07T19:44:18.5445697Z kernel-headers_li~ conda-forge/noarch::kernel-headers_linux-64-3.10.0-he073ed8_18 2025-05-07T19:44:18.5446361Z sysroot_linux-64 conda-forge/noarch::sysroot_linux-64-2.17-h0157908_18 2025-05-07T19:44:18.5446923Z 2025-05-07T19:44:18.5446927Z 2025-05-07T19:44:18.5446930Z 2025-05-07T19:44:18.5447076Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:18.5447485Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:18.5447725Z 2025-05-07T19:44:18.7631791Z kernel-headers_linux | 921 KB | | 0%  2025-05-07T19:44:18.7632134Z 2025-05-07T19:44:18.7712882Z kernel-headers_linux | 921 KB | 1 | 2%  2025-05-07T19:44:18.7713738Z 2025-05-07T19:44:18.9136836Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:18.9137703Z 2025-05-07T19:44:18.9138420Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:18.9138728Z 2025-05-07T19:44:19.3639721Z kernel-headers_linux | 921 KB | ########## | 100%  2025-05-07T19:44:19.4962915Z sysroot_linux-64-2.1 | 14.5 MB | | 0% 2025-05-07T19:44:19.6747245Z sysroot_linux-64-2.1 | 14.5 MB | 1 | 1% 2025-05-07T19:44:19.8553303Z sysroot_linux-64-2.1 | 14.5 MB | 6 | 6% 2025-05-07T19:44:19.9712850Z sysroot_linux-64-2.1 | 14.5 MB | 9 | 10% 2025-05-07T19:44:20.1485483Z sysroot_linux-64-2.1 | 14.5 MB | #2 | 13% 2025-05-07T19:44:20.2870191Z sysroot_linux-64-2.1 | 14.5 MB | #6 | 17% 2025-05-07T19:44:20.5154163Z sysroot_linux-64-2.1 | 14.5 MB | ## | 20% 2025-05-07T19:44:20.6563038Z sysroot_linux-64-2.1 | 14.5 MB | ##4 | 24% 2025-05-07T19:44:20.7896858Z sysroot_linux-64-2.1 | 14.5 MB | ##8 | 28% 2025-05-07T19:44:20.9800041Z sysroot_linux-64-2.1 | 14.5 MB | ###1 | 32% 2025-05-07T19:44:21.1174020Z sysroot_linux-64-2.1 | 14.5 MB | ###5 | 35% 2025-05-07T19:44:21.2994468Z sysroot_linux-64-2.1 | 14.5 MB | ###9 | 39% 2025-05-07T19:44:21.4827404Z sysroot_linux-64-2.1 | 14.5 MB | ####2 | 43% 2025-05-07T19:44:21.5919799Z sysroot_linux-64-2.1 | 14.5 MB | ####6 | 47% 2025-05-07T19:44:21.7062982Z sysroot_linux-64-2.1 | 14.5 MB | ####9 | 49% 2025-05-07T19:44:21.8477090Z sysroot_linux-64-2.1 | 14.5 MB | #####1 | 52% 2025-05-07T19:44:22.0235409Z sysroot_linux-64-2.1 | 14.5 MB | #####4 | 54% 2025-05-07T19:44:22.1578997Z sysroot_linux-64-2.1 | 14.5 MB | ###### | 60% 2025-05-07T19:44:22.3356172Z sysroot_linux-64-2.1 | 14.5 MB | ######2 | 63% 2025-05-07T19:44:22.4706705Z sysroot_linux-64-2.1 | 14.5 MB | ######8 | 68% 2025-05-07T19:44:22.6021659Z sysroot_linux-64-2.1 | 14.5 MB | ####### | 71% 2025-05-07T19:44:22.7733652Z sysroot_linux-64-2.1 | 14.5 MB | #######5 | 76% 2025-05-07T19:44:22.8987199Z sysroot_linux-64-2.1 | 14.5 MB | #######9 | 79% 2025-05-07T19:44:23.0830974Z sysroot_linux-64-2.1 | 14.5 MB | ########1 | 82% 2025-05-07T19:44:23.2147657Z sysroot_linux-64-2.1 | 14.5 MB | ########5 | 85% 2025-05-07T19:44:23.3386126Z sysroot_linux-64-2.1 | 14.5 MB | ######### | 90% 2025-05-07T19:44:23.4756318Z sysroot_linux-64-2.1 | 14.5 MB | #########3 | 94% 2025-05-07T19:44:23.5988336Z sysroot_linux-64-2.1 | 14.5 MB | #########6 | 97% 2025-05-07T19:44:23.5988816Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:24.0401612Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:24.0404199Z sysroot_linux-64-2.1 | 14.5 MB | ########## | 100% 2025-05-07T19:44:24.0404867Z 2025-05-07T19:44:24.0405098Z 2025-05-07T19:44:24.0407533Z  done 2025-05-07T19:44:24.1415994Z Preparing transaction: - done 2025-05-07T19:44:24.3436823Z Verifying transaction: | / done 2025-05-07T19:44:24.4448597Z Executing transaction: \ done 2025-05-07T19:44:24.5299064Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:44:24.5299748Z [CHECK] CONDA_PREFIX is not set. 2025-05-07T19:44:26.1716328Z [CHECK] libstdc++.so.6 found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libstdc++.so.6 2025-05-07T19:44:26.1733789Z [INSTALL] Installing GCC (11.4.0, 64) through Conda ... 2025-05-07T19:44:26.1760045Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y gxx_linux-64=11.4.0 2025-05-07T19:44:26.8687374Z Channels: 2025-05-07T19:44:26.8688089Z - conda-forge 2025-05-07T19:44:26.8688779Z Platform: linux-64 2025-05-07T19:44:29.9120868Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:31.0398868Z Solving environment: \ | / done 2025-05-07T19:44:31.0914960Z 2025-05-07T19:44:31.0915518Z ## Package Plan ## 2025-05-07T19:44:31.0915994Z 2025-05-07T19:44:31.0918068Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:31.0918662Z 2025-05-07T19:44:31.0918825Z added / updated specs: 2025-05-07T19:44:31.0919252Z - gxx_linux-64=11.4.0 2025-05-07T19:44:31.0919520Z 2025-05-07T19:44:31.0919528Z 2025-05-07T19:44:31.0920098Z The following packages will be downloaded: 2025-05-07T19:44:31.0920532Z 2025-05-07T19:44:31.0920688Z package | build 2025-05-07T19:44:31.0921173Z ---------------------------|----------------- 2025-05-07T19:44:31.0921625Z binutils_impl_linux-64-2.40| ha1999f0_7 6.0 MB conda-forge 2025-05-07T19:44:31.0922241Z binutils_linux-64-2.40 | hb3c18ed_4 28 KB conda-forge 2025-05-07T19:44:31.0922918Z gcc_impl_linux-64-11.4.0 | h00c12a0_13 53.0 MB conda-forge 2025-05-07T19:44:31.0923387Z gcc_linux-64-11.4.0 | ha077dfb_4 31 KB conda-forge 2025-05-07T19:44:31.0923860Z gxx_impl_linux-64-11.4.0 | h634f3ee_13 11.2 MB conda-forge 2025-05-07T19:44:31.0924319Z gxx_linux-64-11.4.0 | h35bfe5d_4 29 KB conda-forge 2025-05-07T19:44:31.0924786Z ld_impl_linux-64-2.40 | hf3520f5_7 691 KB conda-forge 2025-05-07T19:44:31.0925279Z libgcc-devel_linux-64-11.4.0| h8f596e0_113 2.3 MB conda-forge 2025-05-07T19:44:31.0925925Z libsanitizer-11.4.0 | h5763a12_13 3.5 MB conda-forge 2025-05-07T19:44:31.0926402Z libstdcxx-15.1.0 | h8f9b012_2 3.7 MB conda-forge 2025-05-07T19:44:31.0926936Z libstdcxx-devel_linux-64-11.4.0| h8f596e0_113 11.1 MB conda-forge 2025-05-07T19:44:31.0927459Z libstdcxx-ng-15.1.0 | h4852527_2 34 KB conda-forge 2025-05-07T19:44:31.0927894Z ------------------------------------------------------------ 2025-05-07T19:44:31.0928439Z Total: 91.6 MB 2025-05-07T19:44:31.0928785Z 2025-05-07T19:44:31.0928985Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:31.0929361Z 2025-05-07T19:44:31.0929799Z binutils_impl_lin~ conda-forge/linux-64::binutils_impl_linux-64-2.40-ha1999f0_7 2025-05-07T19:44:31.0930831Z binutils_linux-64 conda-forge/linux-64::binutils_linux-64-2.40-hb3c18ed_4 2025-05-07T19:44:31.0931924Z gcc_impl_linux-64 conda-forge/linux-64::gcc_impl_linux-64-11.4.0-h00c12a0_13 2025-05-07T19:44:31.0932529Z gcc_linux-64 conda-forge/linux-64::gcc_linux-64-11.4.0-ha077dfb_4 2025-05-07T19:44:31.0933101Z gxx_impl_linux-64 conda-forge/linux-64::gxx_impl_linux-64-11.4.0-h634f3ee_13 2025-05-07T19:44:31.0933639Z gxx_linux-64 conda-forge/linux-64::gxx_linux-64-11.4.0-h35bfe5d_4 2025-05-07T19:44:31.0934548Z libgcc-devel_linu~ conda-forge/noarch::libgcc-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:31.0935561Z libsanitizer conda-forge/linux-64::libsanitizer-11.4.0-h5763a12_13 2025-05-07T19:44:31.0936466Z libstdcxx conda-forge/linux-64::libstdcxx-15.1.0-h8f9b012_2 2025-05-07T19:44:31.0937580Z libstdcxx-devel_l~ conda-forge/noarch::libstdcxx-devel_linux-64-11.4.0-h8f596e0_113 2025-05-07T19:44:31.0938319Z 2025-05-07T19:44:31.0938528Z The following packages will be UPDATED: 2025-05-07T19:44:31.0938878Z 2025-05-07T19:44:31.0939498Z ld_impl_linux-64 pkgs/main::ld_impl_linux-64-2.40-h12e~ --> conda-forge::ld_impl_linux-64-2.40-hf3520f5_7 2025-05-07T19:44:31.0940958Z libstdcxx-ng pkgs/main::libstdcxx-ng-11.2.0-h12345~ --> conda-forge::libstdcxx-ng-15.1.0-h4852527_2 2025-05-07T19:44:31.0941836Z 2025-05-07T19:44:31.0941844Z 2025-05-07T19:44:31.0941851Z 2025-05-07T19:44:31.0942126Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:31.0942769Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.0943188Z 2025-05-07T19:44:31.0943702Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:31.0943986Z 2025-05-07T19:44:31.0943989Z 2025-05-07T19:44:31.0944225Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.0944501Z 2025-05-07T19:44:31.0944505Z 2025-05-07T19:44:31.0944509Z 2025-05-07T19:44:31.0951722Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:31.0952183Z 2025-05-07T19:44:31.0952188Z 2025-05-07T19:44:31.0952404Z 2025-05-07T19:44:31.0952409Z 2025-05-07T19:44:31.0965063Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:31.0966024Z 2025-05-07T19:44:31.0966040Z 2025-05-07T19:44:31.0966052Z 2025-05-07T19:44:31.0966065Z 2025-05-07T19:44:31.0966078Z 2025-05-07T19:44:31.0966979Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.0967325Z 2025-05-07T19:44:31.0967333Z 2025-05-07T19:44:31.0967374Z 2025-05-07T19:44:31.0967377Z 2025-05-07T19:44:31.0967383Z 2025-05-07T19:44:31.0967388Z 2025-05-07T19:44:31.0967672Z libgcc-devel_linux-6 | 2.3 MB | | 0%  2025-05-07T19:44:31.0968019Z 2025-05-07T19:44:31.0968022Z 2025-05-07T19:44:31.0968028Z 2025-05-07T19:44:31.0968031Z 2025-05-07T19:44:31.0968036Z 2025-05-07T19:44:31.0968040Z 2025-05-07T19:44:31.0968044Z 2025-05-07T19:44:31.0968308Z ld_impl_linux-64-2.4 | 691 KB | | 0%  2025-05-07T19:44:31.0968616Z 2025-05-07T19:44:31.0968653Z 2025-05-07T19:44:31.0968681Z 2025-05-07T19:44:31.0968684Z 2025-05-07T19:44:31.0968688Z 2025-05-07T19:44:31.0968691Z 2025-05-07T19:44:31.0968695Z 2025-05-07T19:44:31.0968698Z 2025-05-07T19:44:31.0968998Z libstdcxx-ng-15.1.0 | 34 KB | | 0%  2025-05-07T19:44:31.0969312Z 2025-05-07T19:44:31.0969341Z 2025-05-07T19:44:31.0969344Z 2025-05-07T19:44:31.0969348Z 2025-05-07T19:44:31.0969358Z 2025-05-07T19:44:31.0969361Z 2025-05-07T19:44:31.0969365Z 2025-05-07T19:44:31.0969368Z 2025-05-07T19:44:31.0969372Z 2025-05-07T19:44:31.0969639Z gcc_linux-64-11.4.0 | 31 KB | | 0%  2025-05-07T19:44:31.0969942Z 2025-05-07T19:44:31.0969946Z 2025-05-07T19:44:31.0969973Z 2025-05-07T19:44:31.0969977Z 2025-05-07T19:44:31.0969980Z 2025-05-07T19:44:31.0969984Z 2025-05-07T19:44:31.0969987Z 2025-05-07T19:44:31.0969991Z 2025-05-07T19:44:31.0969994Z 2025-05-07T19:44:31.0969998Z 2025-05-07T19:44:31.0970282Z gxx_linux-64-11.4.0 | 29 KB | | 0%  2025-05-07T19:44:31.0970589Z 2025-05-07T19:44:31.0970618Z 2025-05-07T19:44:31.0970621Z 2025-05-07T19:44:31.0970625Z 2025-05-07T19:44:31.0970629Z 2025-05-07T19:44:31.0970632Z 2025-05-07T19:44:31.0970636Z 2025-05-07T19:44:31.0970639Z 2025-05-07T19:44:31.0970643Z 2025-05-07T19:44:31.0970647Z 2025-05-07T19:44:31.0970650Z 2025-05-07T19:44:31.2584597Z binutils_linux-64-2. | 28 KB | | 0%  2025-05-07T19:44:31.2586118Z 2025-05-07T19:44:31.2586131Z 2025-05-07T19:44:31.2586143Z 2025-05-07T19:44:31.2586154Z 2025-05-07T19:44:31.3892332Z libstdcxx-15.1.0 | 3.7 MB | | 0%  2025-05-07T19:44:31.3893274Z 2025-05-07T19:44:31.3893289Z 2025-05-07T19:44:31.3893301Z 2025-05-07T19:44:31.3893312Z 2025-05-07T19:44:31.4312949Z libstdcxx-15.1.0 | 3.7 MB | 2 | 2%  2025-05-07T19:44:31.4313506Z 2025-05-07T19:44:31.4313511Z 2025-05-07T19:44:31.4313542Z 2025-05-07T19:44:31.4313546Z 2025-05-07T19:44:31.4415417Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.4416333Z 2025-05-07T19:44:31.4416384Z 2025-05-07T19:44:31.4608609Z libstdcxx-devel_linu | 11.1 MB | | 0%  2025-05-07T19:44:31.4609512Z 2025-05-07T19:44:31.4609526Z 2025-05-07T19:44:31.4609537Z 2025-05-07T19:44:31.4686671Z binutils_impl_linux- | 6.0 MB | | 0%  2025-05-07T19:44:31.4687620Z 2025-05-07T19:44:31.4734906Z gxx_impl_linux-64-11 | 11.2 MB | | 0%  2025-05-07T19:44:31.4735789Z 2025-05-07T19:44:31.4735802Z 2025-05-07T19:44:31.4735812Z 2025-05-07T19:44:31.4735823Z 2025-05-07T19:44:31.4735834Z 2025-05-07T19:44:31.4805273Z libsanitizer-11.4.0 | 3.5 MB | | 0%  2025-05-07T19:44:31.5415914Z gcc_impl_linux-64-11 | 53.0 MB | | 0% 2025-05-07T19:44:31.5416708Z 2025-05-07T19:44:31.5416722Z 2025-05-07T19:44:31.5470308Z libstdcxx-devel_linu | 11.1 MB | ########8 | 89%  2025-05-07T19:44:31.5471281Z 2025-05-07T19:44:31.5471295Z 2025-05-07T19:44:31.5471307Z 2025-05-07T19:44:31.5471317Z 2025-05-07T19:44:31.5472078Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.5473215Z 2025-05-07T19:44:31.5473227Z 2025-05-07T19:44:31.5473238Z 2025-05-07T19:44:31.5473250Z 2025-05-07T19:44:31.5573791Z libstdcxx-15.1.0 | 3.7 MB | ########## | 100%  2025-05-07T19:44:31.5574745Z 2025-05-07T19:44:31.5574758Z 2025-05-07T19:44:31.5574801Z 2025-05-07T19:44:31.5574812Z 2025-05-07T19:44:31.5574823Z 2025-05-07T19:44:31.5690693Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.5691048Z 2025-05-07T19:44:31.5805848Z gxx_impl_linux-64-11 | 11.2 MB | #######7 | 77%  2025-05-07T19:44:31.5871881Z gcc_impl_linux-64-11 | 53.0 MB | 9 | 9% 2025-05-07T19:44:31.5872922Z 2025-05-07T19:44:31.5872935Z 2025-05-07T19:44:31.5872981Z 2025-05-07T19:44:31.5873950Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.5874799Z 2025-05-07T19:44:31.5874811Z 2025-05-07T19:44:31.5874822Z 2025-05-07T19:44:31.6039127Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:31.6039467Z 2025-05-07T19:44:31.6039472Z 2025-05-07T19:44:31.6039475Z 2025-05-07T19:44:31.6039479Z 2025-05-07T19:44:31.6039483Z 2025-05-07T19:44:31.6039486Z 2025-05-07T19:44:31.6381175Z libgcc-devel_linux-6 | 2.3 MB | | 1%  2025-05-07T19:44:31.6381552Z 2025-05-07T19:44:31.6381557Z 2025-05-07T19:44:31.6381561Z 2025-05-07T19:44:31.6381566Z 2025-05-07T19:44:31.6381570Z 2025-05-07T19:44:31.6381573Z 2025-05-07T19:44:31.6381578Z 2025-05-07T19:44:31.6500058Z ld_impl_linux-64-2.4 | 691 KB | 2 | 2%  2025-05-07T19:44:31.6500431Z 2025-05-07T19:44:31.6500435Z 2025-05-07T19:44:31.6500439Z 2025-05-07T19:44:31.6500442Z 2025-05-07T19:44:31.6500446Z 2025-05-07T19:44:31.6500486Z 2025-05-07T19:44:31.6500489Z 2025-05-07T19:44:31.6518403Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.6518714Z 2025-05-07T19:44:31.6518717Z 2025-05-07T19:44:31.6518722Z 2025-05-07T19:44:31.6518725Z 2025-05-07T19:44:31.6518729Z 2025-05-07T19:44:31.6518742Z 2025-05-07T19:44:31.6684130Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.6684752Z 2025-05-07T19:44:31.6684757Z 2025-05-07T19:44:31.6894477Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:31.6894839Z 2025-05-07T19:44:31.6894843Z 2025-05-07T19:44:31.6894848Z 2025-05-07T19:44:31.6894852Z 2025-05-07T19:44:31.6894856Z 2025-05-07T19:44:31.6894860Z 2025-05-07T19:44:31.6894864Z 2025-05-07T19:44:31.6900095Z ld_impl_linux-64-2.4 | 691 KB | ########## | 100%  2025-05-07T19:44:31.7032876Z gcc_impl_linux-64-11 | 53.0 MB | #9 | 20% 2025-05-07T19:44:31.7033217Z 2025-05-07T19:44:31.7033244Z 2025-05-07T19:44:31.7033249Z 2025-05-07T19:44:31.7033253Z 2025-05-07T19:44:31.7033258Z 2025-05-07T19:44:31.7033263Z 2025-05-07T19:44:31.7033268Z 2025-05-07T19:44:31.7033272Z 2025-05-07T19:44:31.7051317Z libstdcxx-ng-15.1.0 | 34 KB | ####7 | 47%  2025-05-07T19:44:31.7052329Z 2025-05-07T19:44:31.7052342Z 2025-05-07T19:44:31.7052353Z 2025-05-07T19:44:31.7052364Z 2025-05-07T19:44:31.7052374Z 2025-05-07T19:44:31.7052419Z 2025-05-07T19:44:31.7052431Z 2025-05-07T19:44:31.7052441Z 2025-05-07T19:44:31.7111273Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.7112376Z 2025-05-07T19:44:31.7112380Z 2025-05-07T19:44:31.7112384Z 2025-05-07T19:44:31.7112387Z 2025-05-07T19:44:31.7112391Z 2025-05-07T19:44:31.7112394Z 2025-05-07T19:44:31.7112398Z 2025-05-07T19:44:31.7112401Z 2025-05-07T19:44:31.7112405Z 2025-05-07T19:44:31.7118838Z gcc_linux-64-11.4.0 | 31 KB | #####2 | 52%  2025-05-07T19:44:31.7119154Z 2025-05-07T19:44:31.7119158Z 2025-05-07T19:44:31.7119162Z 2025-05-07T19:44:31.7119165Z 2025-05-07T19:44:31.7119178Z 2025-05-07T19:44:31.7119181Z 2025-05-07T19:44:31.7119185Z 2025-05-07T19:44:31.7119189Z 2025-05-07T19:44:31.7119193Z 2025-05-07T19:44:31.7296588Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.7296933Z 2025-05-07T19:44:31.7296937Z 2025-05-07T19:44:31.7296960Z 2025-05-07T19:44:31.7296964Z 2025-05-07T19:44:31.7296967Z 2025-05-07T19:44:31.7296971Z 2025-05-07T19:44:31.7296975Z 2025-05-07T19:44:31.7296978Z 2025-05-07T19:44:31.7296982Z 2025-05-07T19:44:31.7296985Z 2025-05-07T19:44:31.7305346Z gxx_linux-64-11.4.0 | 29 KB | #####5 | 55%  2025-05-07T19:44:31.7305671Z 2025-05-07T19:44:31.7305675Z 2025-05-07T19:44:31.7305688Z 2025-05-07T19:44:31.7305693Z 2025-05-07T19:44:31.7305696Z 2025-05-07T19:44:31.7305699Z 2025-05-07T19:44:31.7305703Z 2025-05-07T19:44:31.7305717Z 2025-05-07T19:44:31.7305721Z 2025-05-07T19:44:31.7305724Z 2025-05-07T19:44:31.7335369Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.7335710Z 2025-05-07T19:44:31.7385833Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:31.7386160Z 2025-05-07T19:44:31.7386165Z 2025-05-07T19:44:31.7386168Z 2025-05-07T19:44:31.7386172Z 2025-05-07T19:44:31.7386175Z 2025-05-07T19:44:31.7387012Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.7387360Z 2025-05-07T19:44:31.7387365Z 2025-05-07T19:44:31.7387369Z 2025-05-07T19:44:31.7387372Z 2025-05-07T19:44:31.7387398Z 2025-05-07T19:44:31.7578883Z libsanitizer-11.4.0 | 3.5 MB | ########## | 100%  2025-05-07T19:44:31.7579247Z 2025-05-07T19:44:31.7579252Z 2025-05-07T19:44:31.7579256Z 2025-05-07T19:44:31.7579260Z 2025-05-07T19:44:31.7579264Z 2025-05-07T19:44:31.7579268Z 2025-05-07T19:44:31.7579271Z 2025-05-07T19:44:31.7579318Z 2025-05-07T19:44:31.7579321Z 2025-05-07T19:44:31.7579325Z 2025-05-07T19:44:31.7579329Z 2025-05-07T19:44:31.7587719Z binutils_linux-64-2. | 28 KB | #####6 | 56%  2025-05-07T19:44:31.7588061Z 2025-05-07T19:44:31.7588074Z 2025-05-07T19:44:31.7588078Z 2025-05-07T19:44:31.7588104Z 2025-05-07T19:44:31.7588107Z 2025-05-07T19:44:31.7588111Z 2025-05-07T19:44:31.7588114Z 2025-05-07T19:44:31.7588312Z 2025-05-07T19:44:31.7588316Z 2025-05-07T19:44:31.7588319Z 2025-05-07T19:44:31.7588322Z 2025-05-07T19:44:31.7921829Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:31.7992439Z gcc_impl_linux-64-11 | 53.0 MB | ###6 | 36% 2025-05-07T19:44:31.7992902Z 2025-05-07T19:44:31.7993026Z 2025-05-07T19:44:31.7993031Z 2025-05-07T19:44:31.7993059Z 2025-05-07T19:44:31.7993083Z 2025-05-07T19:44:31.7993088Z 2025-05-07T19:44:31.7993611Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.7993934Z 2025-05-07T19:44:31.7993952Z 2025-05-07T19:44:31.7993955Z 2025-05-07T19:44:31.7993959Z 2025-05-07T19:44:31.7993962Z 2025-05-07T19:44:31.7993966Z 2025-05-07T19:44:31.8355119Z libgcc-devel_linux-6 | 2.3 MB | ########## | 100%  2025-05-07T19:44:31.8355469Z 2025-05-07T19:44:31.8355489Z 2025-05-07T19:44:31.8355493Z 2025-05-07T19:44:31.8355497Z 2025-05-07T19:44:31.8355501Z 2025-05-07T19:44:31.8355529Z 2025-05-07T19:44:31.8355533Z 2025-05-07T19:44:31.8355537Z 2025-05-07T19:44:31.8355849Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.8356162Z 2025-05-07T19:44:31.8356167Z 2025-05-07T19:44:31.8356170Z 2025-05-07T19:44:31.8356173Z 2025-05-07T19:44:31.8356177Z 2025-05-07T19:44:31.8356181Z 2025-05-07T19:44:31.8356185Z 2025-05-07T19:44:31.8356189Z 2025-05-07T19:44:31.8674181Z libstdcxx-ng-15.1.0 | 34 KB | ########## | 100%  2025-05-07T19:44:31.8674536Z 2025-05-07T19:44:31.8674770Z 2025-05-07T19:44:31.8674775Z 2025-05-07T19:44:31.8674779Z 2025-05-07T19:44:31.8674782Z 2025-05-07T19:44:31.8674786Z 2025-05-07T19:44:31.8674789Z 2025-05-07T19:44:31.8674793Z 2025-05-07T19:44:31.8674796Z 2025-05-07T19:44:31.8675115Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.8675423Z 2025-05-07T19:44:31.8675427Z 2025-05-07T19:44:31.8675430Z 2025-05-07T19:44:31.8675441Z 2025-05-07T19:44:31.8675444Z 2025-05-07T19:44:31.8675448Z 2025-05-07T19:44:31.8675451Z 2025-05-07T19:44:31.8675455Z 2025-05-07T19:44:31.8675458Z 2025-05-07T19:44:31.8989428Z gcc_linux-64-11.4.0 | 31 KB | ########## | 100%  2025-05-07T19:44:31.8989771Z 2025-05-07T19:44:31.8989776Z 2025-05-07T19:44:31.8989779Z 2025-05-07T19:44:31.8989783Z 2025-05-07T19:44:31.8989786Z 2025-05-07T19:44:31.8989790Z 2025-05-07T19:44:31.8989793Z 2025-05-07T19:44:31.8989797Z 2025-05-07T19:44:31.8989800Z 2025-05-07T19:44:31.8989804Z 2025-05-07T19:44:31.8990125Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:31.8990433Z 2025-05-07T19:44:31.8990437Z 2025-05-07T19:44:31.8990440Z 2025-05-07T19:44:31.8990444Z 2025-05-07T19:44:31.8990447Z 2025-05-07T19:44:31.8990450Z 2025-05-07T19:44:31.8990454Z 2025-05-07T19:44:31.8990457Z 2025-05-07T19:44:31.8990461Z 2025-05-07T19:44:31.8990465Z 2025-05-07T19:44:31.9637500Z gxx_linux-64-11.4.0 | 29 KB | ########## | 100%  2025-05-07T19:44:32.0241176Z gcc_impl_linux-64-11 | 53.0 MB | ####6 | 47% 2025-05-07T19:44:32.0241472Z 2025-05-07T19:44:32.0241478Z 2025-05-07T19:44:32.0241483Z 2025-05-07T19:44:32.0478263Z binutils_impl_linux- | 6.0 MB | ########## | 100%  2025-05-07T19:44:32.0478605Z 2025-05-07T19:44:32.0478611Z 2025-05-07T19:44:32.0478616Z 2025-05-07T19:44:32.0478621Z 2025-05-07T19:44:32.0478625Z 2025-05-07T19:44:32.0478630Z 2025-05-07T19:44:32.0478635Z 2025-05-07T19:44:32.0478668Z 2025-05-07T19:44:32.0478672Z 2025-05-07T19:44:32.0478676Z 2025-05-07T19:44:32.0478706Z 2025-05-07T19:44:32.0481103Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:32.0481435Z 2025-05-07T19:44:32.0481450Z 2025-05-07T19:44:32.0481454Z 2025-05-07T19:44:32.0481458Z 2025-05-07T19:44:32.0481462Z 2025-05-07T19:44:32.0481465Z 2025-05-07T19:44:32.0481488Z 2025-05-07T19:44:32.0481492Z 2025-05-07T19:44:32.0481720Z 2025-05-07T19:44:32.0481724Z 2025-05-07T19:44:32.0481727Z 2025-05-07T19:44:32.0641865Z binutils_linux-64-2. | 28 KB | ########## | 100%  2025-05-07T19:44:32.1427844Z gcc_impl_linux-64-11 | 53.0 MB | ######5 | 65% 2025-05-07T19:44:32.1428184Z 2025-05-07T19:44:32.1655447Z gxx_impl_linux-64-11 | 11.2 MB | ########## | 100%  2025-05-07T19:44:32.2683435Z gcc_impl_linux-64-11 | 53.0 MB | #######9 | 80% 2025-05-07T19:44:32.2733946Z gcc_impl_linux-64-11 | 53.0 MB | #########7 | 97% 2025-05-07T19:44:32.2734774Z 2025-05-07T19:44:32.2734790Z 2025-05-07T19:44:32.4156840Z libstdcxx-devel_linu | 11.1 MB | ########## | 100%  2025-05-07T19:44:32.9444706Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.9458008Z gcc_impl_linux-64-11 | 53.0 MB | ########## | 100% 2025-05-07T19:44:32.9459730Z 2025-05-07T19:44:32.9460580Z 2025-05-07T19:44:32.9461466Z  2025-05-07T19:44:32.9462187Z 2025-05-07T19:44:32.9462199Z 2025-05-07T19:44:32.9462863Z  2025-05-07T19:44:32.9463663Z 2025-05-07T19:44:32.9463676Z 2025-05-07T19:44:32.9463688Z 2025-05-07T19:44:32.9464219Z  2025-05-07T19:44:32.9464905Z 2025-05-07T19:44:32.9464917Z 2025-05-07T19:44:32.9464929Z 2025-05-07T19:44:32.9464941Z 2025-05-07T19:44:32.9465913Z  2025-05-07T19:44:32.9466584Z 2025-05-07T19:44:32.9466595Z 2025-05-07T19:44:32.9466606Z 2025-05-07T19:44:32.9466617Z 2025-05-07T19:44:32.9466670Z 2025-05-07T19:44:32.9467206Z  2025-05-07T19:44:32.9467930Z 2025-05-07T19:44:32.9467941Z 2025-05-07T19:44:32.9467951Z 2025-05-07T19:44:32.9467961Z 2025-05-07T19:44:32.9467972Z 2025-05-07T19:44:32.9468035Z 2025-05-07T19:44:32.9468541Z  2025-05-07T19:44:32.9468882Z 2025-05-07T19:44:32.9468885Z 2025-05-07T19:44:32.9468888Z 2025-05-07T19:44:32.9468891Z 2025-05-07T19:44:32.9468894Z 2025-05-07T19:44:32.9468898Z 2025-05-07T19:44:32.9468901Z 2025-05-07T19:44:32.9469117Z  2025-05-07T19:44:32.9469349Z 2025-05-07T19:44:32.9469352Z 2025-05-07T19:44:32.9469355Z 2025-05-07T19:44:32.9469358Z 2025-05-07T19:44:32.9469369Z 2025-05-07T19:44:32.9469373Z 2025-05-07T19:44:32.9469376Z 2025-05-07T19:44:32.9469379Z 2025-05-07T19:44:32.9469594Z  2025-05-07T19:44:32.9469822Z 2025-05-07T19:44:32.9469825Z 2025-05-07T19:44:32.9469828Z 2025-05-07T19:44:32.9469832Z 2025-05-07T19:44:32.9469836Z 2025-05-07T19:44:32.9469839Z 2025-05-07T19:44:32.9469842Z 2025-05-07T19:44:32.9469845Z 2025-05-07T19:44:32.9469854Z 2025-05-07T19:44:32.9470078Z  2025-05-07T19:44:32.9470309Z 2025-05-07T19:44:32.9470313Z 2025-05-07T19:44:32.9470316Z 2025-05-07T19:44:32.9470319Z 2025-05-07T19:44:32.9470322Z 2025-05-07T19:44:32.9470326Z 2025-05-07T19:44:32.9470329Z 2025-05-07T19:44:32.9470332Z 2025-05-07T19:44:32.9470335Z 2025-05-07T19:44:32.9470338Z 2025-05-07T19:44:32.9470566Z  2025-05-07T19:44:32.9470811Z 2025-05-07T19:44:32.9470814Z 2025-05-07T19:44:32.9470817Z 2025-05-07T19:44:32.9470821Z 2025-05-07T19:44:32.9470824Z 2025-05-07T19:44:32.9470827Z 2025-05-07T19:44:32.9470830Z 2025-05-07T19:44:32.9470834Z 2025-05-07T19:44:32.9470837Z 2025-05-07T19:44:32.9470840Z 2025-05-07T19:44:32.9470843Z 2025-05-07T19:44:32.9471072Z  done 2025-05-07T19:44:33.0471029Z Preparing transaction: \ done 2025-05-07T19:44:33.2476940Z Verifying transaction: / - done 2025-05-07T19:44:33.3493752Z Executing transaction: | done 2025-05-07T19:44:33.4417062Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:37.1777043Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:37.1778896Z 2025-05-07T19:44:37.1788324Z 2025-05-07T19:44:37.1804563Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:37.1806403Z 2025-05-07T19:44:37.1817681Z 2025-05-07T19:44:37.1833999Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:37.1834621Z 2025-05-07T19:44:37.1850057Z 2025-05-07T19:44:37.1865790Z + ln -sf /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:37.1866421Z 2025-05-07T19:44:37.1874280Z 2025-05-07T19:44:37.1882987Z [INSTALL] Installing Clang (16.0.6, 64) and relevant libraries through Conda ... 2025-05-07T19:44:37.1906896Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y clangxx=16.0.6 libcxx llvm-openmp=16.0.6 compiler-rt=16.0.6 2025-05-07T19:44:37.8995043Z Channels: 2025-05-07T19:44:37.8995716Z - conda-forge 2025-05-07T19:44:37.8996390Z Platform: linux-64 2025-05-07T19:44:40.9659660Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:44:42.3134597Z Solving environment: \ | / - done 2025-05-07T19:44:42.3678218Z 2025-05-07T19:44:42.3678833Z ## Package Plan ## 2025-05-07T19:44:42.3679304Z 2025-05-07T19:44:42.3679969Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:44:42.3680706Z 2025-05-07T19:44:42.3680822Z added / updated specs: 2025-05-07T19:44:42.3681104Z - clangxx=16.0.6 2025-05-07T19:44:42.3681425Z - compiler-rt=16.0.6 2025-05-07T19:44:42.3681708Z - libcxx 2025-05-07T19:44:42.3681942Z - llvm-openmp=16.0.6 2025-05-07T19:44:42.3682110Z 2025-05-07T19:44:42.3682114Z 2025-05-07T19:44:42.3682274Z The following packages will be downloaded: 2025-05-07T19:44:42.3682511Z 2025-05-07T19:44:42.3682637Z package | build 2025-05-07T19:44:42.3683127Z ---------------------------|----------------- 2025-05-07T19:44:42.3683534Z clang-16.0.6 |default_h9e3a008_14 110 KB conda-forge 2025-05-07T19:44:42.3684049Z clang-16-16.0.6 |default_hb5137d0_14 780 KB conda-forge 2025-05-07T19:44:42.3684662Z clangxx-16.0.6 |default_ha78316a_14 110 KB conda-forge 2025-05-07T19:44:42.3685115Z compiler-rt-16.0.6 | h00ab1b0_2 107 KB conda-forge 2025-05-07T19:44:42.3685806Z compiler-rt_linux-64-16.0.6| h00ab1b0_2 36.0 MB conda-forge 2025-05-07T19:44:42.3686272Z icu-73.2 | h59595ed_0 11.5 MB conda-forge 2025-05-07T19:44:42.3686774Z libclang-cpp16-16.0.6 |default_hb5137d0_14 17.3 MB conda-forge 2025-05-07T19:44:42.3687265Z libcxx-19.1.7 | h2713693_1 1000 KB conda-forge 2025-05-07T19:44:42.3687748Z libcxxabi-19.1.7 | hd85fd95_1 158 KB conda-forge 2025-05-07T19:44:42.3688243Z libiconv-1.18 | h4ce23a2_1 696 KB conda-forge 2025-05-07T19:44:42.3688706Z libllvm16-16.0.6 | hb3ce162_3 33.7 MB conda-forge 2025-05-07T19:44:42.3689182Z libxml2-2.12.7 | hc051c1a_1 688 KB conda-forge 2025-05-07T19:44:42.3689623Z libzlib-1.2.13 | h4ab18f5_6 60 KB conda-forge 2025-05-07T19:44:42.3690114Z llvm-openmp-16.0.6 | h4dfa4b3_0 39.9 MB conda-forge 2025-05-07T19:44:42.3690586Z zlib-1.2.13 | h4ab18f5_6 91 KB conda-forge 2025-05-07T19:44:42.3691318Z zstd-1.5.6 | ha6fb4c9_0 542 KB conda-forge 2025-05-07T19:44:42.3691733Z ------------------------------------------------------------ 2025-05-07T19:44:42.3692144Z Total: 142.6 MB 2025-05-07T19:44:42.3692375Z 2025-05-07T19:44:42.3692518Z The following NEW packages will be INSTALLED: 2025-05-07T19:44:42.3692787Z 2025-05-07T19:44:42.3693038Z clang conda-forge/linux-64::clang-16.0.6-default_h9e3a008_14 2025-05-07T19:44:42.3693596Z clang-16 conda-forge/linux-64::clang-16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:42.3694131Z clangxx conda-forge/linux-64::clangxx-16.0.6-default_ha78316a_14 2025-05-07T19:44:42.3694718Z compiler-rt conda-forge/linux-64::compiler-rt-16.0.6-h00ab1b0_2 2025-05-07T19:44:42.3695325Z compiler-rt_linux~ conda-forge/noarch::compiler-rt_linux-64-16.0.6-h00ab1b0_2 2025-05-07T19:44:42.3695848Z icu conda-forge/linux-64::icu-73.2-h59595ed_0 2025-05-07T19:44:42.3696418Z libclang-cpp16 conda-forge/linux-64::libclang-cpp16-16.0.6-default_hb5137d0_14 2025-05-07T19:44:42.3697096Z libcxx conda-forge/linux-64::libcxx-19.1.7-h2713693_1 2025-05-07T19:44:42.3697593Z libcxxabi conda-forge/linux-64::libcxxabi-19.1.7-hd85fd95_1 2025-05-07T19:44:42.3698068Z libiconv conda-forge/linux-64::libiconv-1.18-h4ce23a2_1 2025-05-07T19:44:42.3698570Z libllvm16 conda-forge/linux-64::libllvm16-16.0.6-hb3ce162_3 2025-05-07T19:44:42.3699381Z libxml2 conda-forge/linux-64::libxml2-2.12.7-hc051c1a_1 2025-05-07T19:44:42.3699858Z libzlib conda-forge/linux-64::libzlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:42.3700402Z llvm-openmp conda-forge/linux-64::llvm-openmp-16.0.6-h4dfa4b3_0 2025-05-07T19:44:42.3703442Z zstd conda-forge/linux-64::zstd-1.5.6-ha6fb4c9_0 2025-05-07T19:44:42.3703753Z 2025-05-07T19:44:42.3703888Z The following packages will be UPDATED: 2025-05-07T19:44:42.3704132Z 2025-05-07T19:44:42.3704433Z zlib pkgs/main::zlib-1.2.13-h5eee18b_1 --> conda-forge::zlib-1.2.13-h4ab18f5_6 2025-05-07T19:44:42.3704808Z 2025-05-07T19:44:42.3704812Z 2025-05-07T19:44:42.3704816Z 2025-05-07T19:44:42.3704982Z Downloading and Extracting Packages: ...working... 2025-05-07T19:44:42.3705445Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:42.3705713Z 2025-05-07T19:44:42.3706075Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:42.3706355Z 2025-05-07T19:44:42.3706359Z 2025-05-07T19:44:42.3706594Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:42.3707018Z 2025-05-07T19:44:42.3707021Z 2025-05-07T19:44:42.3707025Z 2025-05-07T19:44:42.3707405Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:42.3707677Z 2025-05-07T19:44:42.3707681Z 2025-05-07T19:44:42.3707685Z 2025-05-07T19:44:42.3707688Z 2025-05-07T19:44:42.3713237Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:42.3713497Z 2025-05-07T19:44:42.3713501Z 2025-05-07T19:44:42.3713504Z 2025-05-07T19:44:42.3713508Z 2025-05-07T19:44:42.3713511Z 2025-05-07T19:44:42.3714281Z libcxx-19.1.7 | 1000 KB | | 0%  2025-05-07T19:44:42.3714576Z 2025-05-07T19:44:42.3714579Z 2025-05-07T19:44:42.3714583Z 2025-05-07T19:44:42.3714586Z 2025-05-07T19:44:42.3714590Z 2025-05-07T19:44:42.3714594Z 2025-05-07T19:44:42.3717822Z clang-16-16.0.6 | 780 KB | | 0%  2025-05-07T19:44:42.3718138Z 2025-05-07T19:44:42.3718154Z 2025-05-07T19:44:42.3718158Z 2025-05-07T19:44:42.3718161Z 2025-05-07T19:44:42.3718165Z 2025-05-07T19:44:42.3718169Z 2025-05-07T19:44:42.3720448Z 2025-05-07T19:44:42.3722193Z libiconv-1.18 | 696 KB | | 0%  2025-05-07T19:44:42.3723054Z 2025-05-07T19:44:42.3723092Z 2025-05-07T19:44:42.3723103Z 2025-05-07T19:44:42.3723326Z 2025-05-07T19:44:42.3723338Z 2025-05-07T19:44:42.3723348Z 2025-05-07T19:44:42.3723358Z 2025-05-07T19:44:42.3723369Z 2025-05-07T19:44:42.3724091Z libxml2-2.12.7 | 688 KB | | 0%  2025-05-07T19:44:42.3724953Z 2025-05-07T19:44:42.3724964Z 2025-05-07T19:44:42.3724974Z 2025-05-07T19:44:42.3724985Z 2025-05-07T19:44:42.3724996Z 2025-05-07T19:44:42.3725006Z 2025-05-07T19:44:42.3725017Z 2025-05-07T19:44:42.3725027Z 2025-05-07T19:44:42.3725038Z 2025-05-07T19:44:42.3725760Z zstd-1.5.6 | 542 KB | | 0%  2025-05-07T19:44:42.3726586Z 2025-05-07T19:44:42.3726597Z 2025-05-07T19:44:42.3726608Z 2025-05-07T19:44:42.3726618Z 2025-05-07T19:44:42.3726628Z 2025-05-07T19:44:42.3726639Z 2025-05-07T19:44:42.3726649Z 2025-05-07T19:44:42.3726661Z 2025-05-07T19:44:42.3726671Z 2025-05-07T19:44:42.3726682Z 2025-05-07T19:44:42.3727513Z libcxxabi-19.1.7 | 158 KB | | 0%  2025-05-07T19:44:42.3728416Z 2025-05-07T19:44:42.3728427Z 2025-05-07T19:44:42.3728437Z 2025-05-07T19:44:42.3728448Z 2025-05-07T19:44:42.3728459Z 2025-05-07T19:44:42.3728469Z 2025-05-07T19:44:42.3728480Z 2025-05-07T19:44:42.3728490Z 2025-05-07T19:44:42.3728500Z 2025-05-07T19:44:42.3728511Z 2025-05-07T19:44:42.3728521Z 2025-05-07T19:44:42.3729282Z clang-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:42.3730118Z 2025-05-07T19:44:42.3730129Z 2025-05-07T19:44:42.3730140Z 2025-05-07T19:44:42.3730306Z 2025-05-07T19:44:42.3730320Z 2025-05-07T19:44:42.3730330Z 2025-05-07T19:44:42.3730341Z 2025-05-07T19:44:42.3730351Z 2025-05-07T19:44:42.3730361Z 2025-05-07T19:44:42.3730372Z 2025-05-07T19:44:42.3730382Z 2025-05-07T19:44:42.3730457Z 2025-05-07T19:44:42.3731619Z clangxx-16.0.6 | 110 KB | | 0%  2025-05-07T19:44:42.3732507Z 2025-05-07T19:44:42.3732519Z 2025-05-07T19:44:42.3732543Z 2025-05-07T19:44:42.3732554Z 2025-05-07T19:44:42.3732564Z 2025-05-07T19:44:42.3732575Z 2025-05-07T19:44:42.3732585Z 2025-05-07T19:44:42.3732627Z 2025-05-07T19:44:42.3732638Z 2025-05-07T19:44:42.3732648Z 2025-05-07T19:44:42.3732659Z 2025-05-07T19:44:42.3732670Z 2025-05-07T19:44:42.3732680Z 2025-05-07T19:44:42.3733642Z compiler-rt-16.0.6 | 107 KB | | 0%  2025-05-07T19:44:42.3733972Z 2025-05-07T19:44:42.3733976Z 2025-05-07T19:44:42.3733980Z 2025-05-07T19:44:42.3733983Z 2025-05-07T19:44:42.3734020Z 2025-05-07T19:44:42.3734023Z 2025-05-07T19:44:42.3734027Z 2025-05-07T19:44:42.3734031Z 2025-05-07T19:44:42.3734035Z 2025-05-07T19:44:42.3734039Z 2025-05-07T19:44:42.3734042Z 2025-05-07T19:44:42.3734046Z 2025-05-07T19:44:42.3734049Z 2025-05-07T19:44:42.3734053Z 2025-05-07T19:44:42.3734323Z zlib-1.2.13 | 91 KB | | 0%  2025-05-07T19:44:42.3734656Z 2025-05-07T19:44:42.3734664Z 2025-05-07T19:44:42.3734668Z 2025-05-07T19:44:42.3734671Z 2025-05-07T19:44:42.3734675Z 2025-05-07T19:44:42.3734678Z 2025-05-07T19:44:42.3734682Z 2025-05-07T19:44:42.3734686Z 2025-05-07T19:44:42.3734689Z 2025-05-07T19:44:42.3734692Z 2025-05-07T19:44:42.3734696Z 2025-05-07T19:44:42.3734699Z 2025-05-07T19:44:42.3734703Z 2025-05-07T19:44:42.3734707Z 2025-05-07T19:44:42.3734716Z 2025-05-07T19:44:42.7484424Z libzlib-1.2.13 | 60 KB | | 0%  2025-05-07T19:44:42.7484791Z 2025-05-07T19:44:42.7484819Z 2025-05-07T19:44:42.7484824Z 2025-05-07T19:44:42.7484828Z 2025-05-07T19:44:42.7787988Z icu-73.2 | 11.5 MB | | 0%  2025-05-07T19:44:42.7788287Z 2025-05-07T19:44:42.7933040Z compiler-rt_linux-64 | 36.0 MB | | 0%  2025-05-07T19:44:42.7933925Z 2025-05-07T19:44:42.7933954Z 2025-05-07T19:44:42.8220980Z libllvm16-16.0.6 | 33.7 MB | | 0%  2025-05-07T19:44:42.8222221Z 2025-05-07T19:44:42.8222227Z 2025-05-07T19:44:42.8222231Z 2025-05-07T19:44:42.8261876Z libclang-cpp16-16.0. | 17.3 MB | | 0%  2025-05-07T19:44:42.8483338Z llvm-openmp-16.0.6 | 39.9 MB | | 0% 2025-05-07T19:44:42.8483684Z 2025-05-07T19:44:42.8483689Z 2025-05-07T19:44:42.8483692Z 2025-05-07T19:44:42.8483696Z 2025-05-07T19:44:42.8787730Z icu-73.2 | 11.5 MB | ###5 | 36%  2025-05-07T19:44:42.8788056Z 2025-05-07T19:44:42.8932633Z compiler-rt_linux-64 | 36.0 MB | ##5 | 26%  2025-05-07T19:44:42.8933142Z 2025-05-07T19:44:42.8933245Z 2025-05-07T19:44:42.9223682Z libllvm16-16.0.6 | 33.7 MB | #1 | 12%  2025-05-07T19:44:42.9224091Z 2025-05-07T19:44:42.9224097Z 2025-05-07T19:44:42.9224101Z 2025-05-07T19:44:42.9262133Z libclang-cpp16-16.0. | 17.3 MB | ##6 | 27%  2025-05-07T19:44:42.9483296Z llvm-openmp-16.0.6 | 39.9 MB | #3 | 13% 2025-05-07T19:44:42.9483673Z 2025-05-07T19:44:42.9483679Z 2025-05-07T19:44:42.9483684Z 2025-05-07T19:44:42.9483698Z 2025-05-07T19:44:42.9786470Z icu-73.2 | 11.5 MB | ########9 | 89%  2025-05-07T19:44:42.9786761Z 2025-05-07T19:44:42.9932415Z compiler-rt_linux-64 | 36.0 MB | ####2 | 43%  2025-05-07T19:44:42.9932752Z 2025-05-07T19:44:42.9932759Z 2025-05-07T19:44:43.0267448Z libllvm16-16.0.6 | 33.7 MB | ###2 | 32%  2025-05-07T19:44:43.0307236Z llvm-openmp-16.0.6 | 39.9 MB | ##8 | 28% 2025-05-07T19:44:43.0307750Z 2025-05-07T19:44:43.0307757Z 2025-05-07T19:44:43.0307761Z 2025-05-07T19:44:43.0711545Z libclang-cpp16-16.0. | 17.3 MB | ####7 | 48%  2025-05-07T19:44:43.0711889Z 2025-05-07T19:44:43.0711895Z 2025-05-07T19:44:43.0711901Z 2025-05-07T19:44:43.0711908Z 2025-05-07T19:44:43.0788878Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:43.0789178Z 2025-05-07T19:44:43.0935497Z compiler-rt_linux-64 | 36.0 MB | #######8 | 78%  2025-05-07T19:44:43.0936396Z 2025-05-07T19:44:43.0936414Z 2025-05-07T19:44:43.1150131Z libllvm16-16.0.6 | 33.7 MB | #####4 | 54%  2025-05-07T19:44:43.1150427Z 2025-05-07T19:44:43.1150607Z 2025-05-07T19:44:43.1150615Z 2025-05-07T19:44:43.1150687Z 2025-05-07T19:44:43.1150693Z 2025-05-07T19:44:43.1267563Z libcxx-19.1.7 | 1000 KB | 1 | 2%  2025-05-07T19:44:43.1372266Z llvm-openmp-16.0.6 | 39.9 MB | ####4 | 45% 2025-05-07T19:44:43.1372770Z 2025-05-07T19:44:43.1372977Z 2025-05-07T19:44:43.1372984Z 2025-05-07T19:44:43.1510121Z libclang-cpp16-16.0. | 17.3 MB | ####### | 71%  2025-05-07T19:44:43.1510434Z 2025-05-07T19:44:43.1510439Z 2025-05-07T19:44:43.1510443Z 2025-05-07T19:44:43.1510446Z 2025-05-07T19:44:43.1510566Z 2025-05-07T19:44:43.1901592Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:43.1901903Z 2025-05-07T19:44:43.1902051Z 2025-05-07T19:44:43.1902080Z 2025-05-07T19:44:43.1902085Z 2025-05-07T19:44:43.1902097Z 2025-05-07T19:44:43.1902287Z 2025-05-07T19:44:43.2194619Z clang-16-16.0.6 | 780 KB | 2 | 2%  2025-05-07T19:44:43.2194942Z 2025-05-07T19:44:43.2194947Z 2025-05-07T19:44:43.2210433Z libllvm16-16.0.6 | 33.7 MB | ####### | 71%  2025-05-07T19:44:43.2210772Z 2025-05-07T19:44:43.2210777Z 2025-05-07T19:44:43.2210782Z 2025-05-07T19:44:43.2210787Z 2025-05-07T19:44:43.2210793Z 2025-05-07T19:44:43.2210798Z 2025-05-07T19:44:43.2373058Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:43.2373372Z 2025-05-07T19:44:43.2373376Z 2025-05-07T19:44:43.2373380Z 2025-05-07T19:44:43.2559013Z libclang-cpp16-16.0. | 17.3 MB | #########7 | 97%  2025-05-07T19:44:43.2559334Z 2025-05-07T19:44:43.2559341Z 2025-05-07T19:44:43.2559351Z 2025-05-07T19:44:43.2559355Z 2025-05-07T19:44:43.2559359Z 2025-05-07T19:44:43.2559362Z 2025-05-07T19:44:43.2559471Z 2025-05-07T19:44:43.2665515Z libiconv-1.18 | 696 KB | 2 | 2%  2025-05-07T19:44:43.2829515Z llvm-openmp-16.0.6 | 39.9 MB | #####8 | 58% 2025-05-07T19:44:43.2830007Z 2025-05-07T19:44:43.2830021Z 2025-05-07T19:44:43.2830060Z 2025-05-07T19:44:43.2830064Z 2025-05-07T19:44:43.2830068Z 2025-05-07T19:44:43.2830073Z 2025-05-07T19:44:43.2830106Z 2025-05-07T19:44:43.3152056Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:43.3152407Z 2025-05-07T19:44:43.3152417Z 2025-05-07T19:44:43.3152463Z 2025-05-07T19:44:43.3152468Z 2025-05-07T19:44:43.3152543Z 2025-05-07T19:44:43.3157690Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:43.3157985Z 2025-05-07T19:44:43.3157990Z 2025-05-07T19:44:43.3158000Z 2025-05-07T19:44:43.3158005Z 2025-05-07T19:44:43.3158008Z 2025-05-07T19:44:43.3272180Z libcxx-19.1.7 | 1000 KB | ########## | 100%  2025-05-07T19:44:43.3272592Z 2025-05-07T19:44:43.3272625Z 2025-05-07T19:44:43.3272629Z 2025-05-07T19:44:43.3272632Z 2025-05-07T19:44:43.3272636Z 2025-05-07T19:44:43.3272639Z 2025-05-07T19:44:43.3272643Z 2025-05-07T19:44:43.3272647Z 2025-05-07T19:44:43.3275252Z libxml2-2.12.7 | 688 KB | 2 | 2%  2025-05-07T19:44:43.3275552Z 2025-05-07T19:44:43.3275563Z 2025-05-07T19:44:43.3637753Z libllvm16-16.0.6 | 33.7 MB | ########5 | 86%  2025-05-07T19:44:43.3638655Z 2025-05-07T19:44:43.3638669Z 2025-05-07T19:44:43.3638680Z 2025-05-07T19:44:43.3639169Z 2025-05-07T19:44:43.3639183Z 2025-05-07T19:44:43.3639194Z 2025-05-07T19:44:43.3639204Z 2025-05-07T19:44:43.3639214Z 2025-05-07T19:44:43.3667574Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:43.4102976Z llvm-openmp-16.0.6 | 39.9 MB | ####### | 71% 2025-05-07T19:44:43.4103824Z 2025-05-07T19:44:43.4103838Z 2025-05-07T19:44:43.4103850Z 2025-05-07T19:44:43.4103860Z 2025-05-07T19:44:43.4103906Z 2025-05-07T19:44:43.4103947Z 2025-05-07T19:44:43.4103957Z 2025-05-07T19:44:43.4103968Z 2025-05-07T19:44:43.4103978Z 2025-05-07T19:44:43.4272696Z zstd-1.5.6 | 542 KB | 2 | 3%  2025-05-07T19:44:43.4273033Z 2025-05-07T19:44:43.4273038Z 2025-05-07T19:44:43.4273041Z 2025-05-07T19:44:43.4273070Z 2025-05-07T19:44:43.4273073Z 2025-05-07T19:44:43.4273078Z 2025-05-07T19:44:43.4273083Z 2025-05-07T19:44:43.4273182Z 2025-05-07T19:44:43.4273192Z 2025-05-07T19:44:43.4598208Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:43.4598542Z 2025-05-07T19:44:43.4598559Z 2025-05-07T19:44:43.4598563Z 2025-05-07T19:44:43.4598567Z 2025-05-07T19:44:43.4598570Z 2025-05-07T19:44:43.4598574Z 2025-05-07T19:44:43.4598577Z 2025-05-07T19:44:43.4601754Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:43.4602073Z 2025-05-07T19:44:43.4602077Z 2025-05-07T19:44:43.4602101Z 2025-05-07T19:44:43.4602105Z 2025-05-07T19:44:43.4602109Z 2025-05-07T19:44:43.4602114Z 2025-05-07T19:44:43.4602194Z 2025-05-07T19:44:43.4612825Z libiconv-1.18 | 696 KB | ########## | 100%  2025-05-07T19:44:43.4613141Z 2025-05-07T19:44:43.4613158Z 2025-05-07T19:44:43.4613184Z 2025-05-07T19:44:43.4613188Z 2025-05-07T19:44:43.4613192Z 2025-05-07T19:44:43.4613196Z 2025-05-07T19:44:43.4613199Z 2025-05-07T19:44:43.4613203Z 2025-05-07T19:44:43.4613207Z 2025-05-07T19:44:43.4613328Z 2025-05-07T19:44:43.4670684Z libcxxabi-19.1.7 | 158 KB | # | 10%  2025-05-07T19:44:43.4671659Z 2025-05-07T19:44:43.4671663Z 2025-05-07T19:44:43.4671667Z 2025-05-07T19:44:43.4671671Z 2025-05-07T19:44:43.4671674Z 2025-05-07T19:44:43.4671678Z 2025-05-07T19:44:43.4671681Z 2025-05-07T19:44:43.4671685Z 2025-05-07T19:44:43.4671689Z 2025-05-07T19:44:43.4671692Z 2025-05-07T19:44:43.4930978Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:43.5103088Z llvm-openmp-16.0.6 | 39.9 MB | ########2 | 83% 2025-05-07T19:44:43.5103576Z 2025-05-07T19:44:43.5103638Z 2025-05-07T19:44:43.5103644Z 2025-05-07T19:44:43.5103679Z 2025-05-07T19:44:43.5103686Z 2025-05-07T19:44:43.5103690Z 2025-05-07T19:44:43.5103714Z 2025-05-07T19:44:43.5103718Z 2025-05-07T19:44:43.5103781Z 2025-05-07T19:44:43.5103785Z 2025-05-07T19:44:43.5137712Z 2025-05-07T19:44:43.5138586Z clang-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:43.5138935Z 2025-05-07T19:44:43.5138939Z 2025-05-07T19:44:43.5138943Z 2025-05-07T19:44:43.5138946Z 2025-05-07T19:44:43.5138970Z 2025-05-07T19:44:43.5138974Z 2025-05-07T19:44:43.5138978Z 2025-05-07T19:44:43.5138981Z 2025-05-07T19:44:43.5138984Z 2025-05-07T19:44:43.5138988Z 2025-05-07T19:44:43.5138992Z 2025-05-07T19:44:43.5506236Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.5507201Z 2025-05-07T19:44:43.5507252Z 2025-05-07T19:44:43.5507263Z 2025-05-07T19:44:43.5507274Z 2025-05-07T19:44:43.5507285Z 2025-05-07T19:44:43.5507296Z 2025-05-07T19:44:43.5507307Z 2025-05-07T19:44:43.5507317Z 2025-05-07T19:44:43.5507328Z 2025-05-07T19:44:43.5507338Z 2025-05-07T19:44:43.5507349Z 2025-05-07T19:44:43.5507360Z 2025-05-07T19:44:43.5542641Z clangxx-16.0.6 | 110 KB | #4 | 15%  2025-05-07T19:44:43.5543017Z 2025-05-07T19:44:43.5543022Z 2025-05-07T19:44:43.5543025Z 2025-05-07T19:44:43.5543223Z 2025-05-07T19:44:43.5543228Z 2025-05-07T19:44:43.5543231Z 2025-05-07T19:44:43.5543235Z 2025-05-07T19:44:43.5543238Z 2025-05-07T19:44:43.5543242Z 2025-05-07T19:44:43.5543245Z 2025-05-07T19:44:43.5543249Z 2025-05-07T19:44:43.5543252Z 2025-05-07T19:44:43.5590972Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.5591377Z 2025-05-07T19:44:43.5591531Z 2025-05-07T19:44:43.5591556Z 2025-05-07T19:44:43.5591561Z 2025-05-07T19:44:43.5591566Z 2025-05-07T19:44:43.5591570Z 2025-05-07T19:44:43.5591575Z 2025-05-07T19:44:43.5591580Z 2025-05-07T19:44:43.5592076Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:43.5592393Z 2025-05-07T19:44:43.5592396Z 2025-05-07T19:44:43.5592400Z 2025-05-07T19:44:43.5592404Z 2025-05-07T19:44:43.5592407Z 2025-05-07T19:44:43.5592410Z 2025-05-07T19:44:43.5592414Z 2025-05-07T19:44:43.5592417Z 2025-05-07T19:44:43.5745420Z libxml2-2.12.7 | 688 KB | ########## | 100%  2025-05-07T19:44:43.5745746Z 2025-05-07T19:44:43.5745900Z 2025-05-07T19:44:43.5745910Z 2025-05-07T19:44:43.6024380Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:43.6024709Z 2025-05-07T19:44:43.6024716Z 2025-05-07T19:44:43.6024722Z 2025-05-07T19:44:43.6024729Z 2025-05-07T19:44:43.6024748Z 2025-05-07T19:44:43.6024753Z 2025-05-07T19:44:43.6024759Z 2025-05-07T19:44:43.6024801Z 2025-05-07T19:44:43.6024805Z 2025-05-07T19:44:43.6024809Z 2025-05-07T19:44:43.6024812Z 2025-05-07T19:44:43.6024816Z 2025-05-07T19:44:43.6024821Z 2025-05-07T19:44:43.6055943Z compiler-rt-16.0.6 | 107 KB | #4 | 15%  2025-05-07T19:44:43.6056350Z 2025-05-07T19:44:43.6056356Z 2025-05-07T19:44:43.6056361Z 2025-05-07T19:44:43.6056364Z 2025-05-07T19:44:43.6056368Z 2025-05-07T19:44:43.6056372Z 2025-05-07T19:44:43.6056375Z 2025-05-07T19:44:43.6056379Z 2025-05-07T19:44:43.6056384Z 2025-05-07T19:44:43.6056655Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:43.6056961Z 2025-05-07T19:44:43.6056964Z 2025-05-07T19:44:43.6056969Z 2025-05-07T19:44:43.6056974Z 2025-05-07T19:44:43.6056979Z 2025-05-07T19:44:43.6056984Z 2025-05-07T19:44:43.6056989Z 2025-05-07T19:44:43.6056995Z 2025-05-07T19:44:43.6057000Z 2025-05-07T19:44:43.6057004Z 2025-05-07T19:44:43.6057010Z 2025-05-07T19:44:43.6057015Z 2025-05-07T19:44:43.6057215Z 2025-05-07T19:44:43.6059362Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:43.6059689Z 2025-05-07T19:44:43.6059692Z 2025-05-07T19:44:43.6059696Z 2025-05-07T19:44:43.6059700Z 2025-05-07T19:44:43.6059703Z 2025-05-07T19:44:43.6059707Z 2025-05-07T19:44:43.6059717Z 2025-05-07T19:44:43.6059720Z 2025-05-07T19:44:43.6059724Z 2025-05-07T19:44:43.6115930Z zstd-1.5.6 | 542 KB | ########## | 100%  2025-05-07T19:44:43.6336750Z llvm-openmp-16.0.6 | 39.9 MB | #########8 | 98% 2025-05-07T19:44:43.6337119Z 2025-05-07T19:44:43.6337123Z 2025-05-07T19:44:43.6337139Z 2025-05-07T19:44:43.6337143Z 2025-05-07T19:44:43.6337146Z 2025-05-07T19:44:43.6337150Z 2025-05-07T19:44:43.6337153Z 2025-05-07T19:44:43.6337157Z 2025-05-07T19:44:43.6337161Z 2025-05-07T19:44:43.6337164Z 2025-05-07T19:44:43.6337168Z 2025-05-07T19:44:43.6337172Z 2025-05-07T19:44:43.6337175Z 2025-05-07T19:44:43.6337188Z 2025-05-07T19:44:43.6360969Z zlib-1.2.13 | 91 KB | #7 | 18%  2025-05-07T19:44:43.6361290Z 2025-05-07T19:44:43.6361295Z 2025-05-07T19:44:43.6361298Z 2025-05-07T19:44:43.6361302Z 2025-05-07T19:44:43.6361305Z 2025-05-07T19:44:43.6361309Z 2025-05-07T19:44:43.6361312Z 2025-05-07T19:44:43.6361316Z 2025-05-07T19:44:43.6361320Z 2025-05-07T19:44:43.6361323Z 2025-05-07T19:44:43.6361327Z 2025-05-07T19:44:43.6361330Z 2025-05-07T19:44:43.6361334Z 2025-05-07T19:44:43.6361343Z 2025-05-07T19:44:43.6397949Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:43.6398291Z 2025-05-07T19:44:43.6398295Z 2025-05-07T19:44:43.6398299Z 2025-05-07T19:44:43.6398303Z 2025-05-07T19:44:43.6398306Z 2025-05-07T19:44:43.6398310Z 2025-05-07T19:44:43.6398314Z 2025-05-07T19:44:43.6398317Z 2025-05-07T19:44:43.6398321Z 2025-05-07T19:44:43.6398518Z 2025-05-07T19:44:43.6401639Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:43.6401967Z 2025-05-07T19:44:43.6401971Z 2025-05-07T19:44:43.6401974Z 2025-05-07T19:44:43.6401977Z 2025-05-07T19:44:43.6401990Z 2025-05-07T19:44:43.6401993Z 2025-05-07T19:44:43.6402018Z 2025-05-07T19:44:43.6402021Z 2025-05-07T19:44:43.6402025Z 2025-05-07T19:44:43.6402028Z 2025-05-07T19:44:43.6503242Z libcxxabi-19.1.7 | 158 KB | ########## | 100%  2025-05-07T19:44:43.6503592Z 2025-05-07T19:44:43.6503597Z 2025-05-07T19:44:43.6503614Z 2025-05-07T19:44:43.6503641Z 2025-05-07T19:44:43.6503645Z 2025-05-07T19:44:43.6503649Z 2025-05-07T19:44:43.6503652Z 2025-05-07T19:44:43.6503655Z 2025-05-07T19:44:43.6503659Z 2025-05-07T19:44:43.6503662Z 2025-05-07T19:44:43.6503666Z 2025-05-07T19:44:43.6503669Z 2025-05-07T19:44:43.6503673Z 2025-05-07T19:44:43.6503676Z 2025-05-07T19:44:43.6503680Z 2025-05-07T19:44:43.6515736Z libzlib-1.2.13 | 60 KB | ##6 | 27%  2025-05-07T19:44:43.6516087Z 2025-05-07T19:44:43.6516090Z 2025-05-07T19:44:43.6516109Z 2025-05-07T19:44:43.6516113Z 2025-05-07T19:44:43.6516116Z 2025-05-07T19:44:43.6516120Z 2025-05-07T19:44:43.6516124Z 2025-05-07T19:44:43.6516127Z 2025-05-07T19:44:43.6516131Z 2025-05-07T19:44:43.6516135Z 2025-05-07T19:44:43.6516139Z 2025-05-07T19:44:43.6516142Z 2025-05-07T19:44:43.6516146Z 2025-05-07T19:44:43.6516149Z 2025-05-07T19:44:43.6516153Z 2025-05-07T19:44:43.7167035Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:43.7167390Z 2025-05-07T19:44:43.7167395Z 2025-05-07T19:44:43.7167399Z 2025-05-07T19:44:43.7167404Z 2025-05-07T19:44:43.7167408Z 2025-05-07T19:44:43.7167411Z 2025-05-07T19:44:43.7170877Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:43.7171197Z 2025-05-07T19:44:43.7171202Z 2025-05-07T19:44:43.7171206Z 2025-05-07T19:44:43.7171211Z 2025-05-07T19:44:43.7171492Z 2025-05-07T19:44:43.7171505Z 2025-05-07T19:44:43.7442105Z clang-16-16.0.6 | 780 KB | ########## | 100%  2025-05-07T19:44:43.7442409Z 2025-05-07T19:44:43.7442426Z 2025-05-07T19:44:43.7442430Z 2025-05-07T19:44:43.7442435Z 2025-05-07T19:44:43.7442438Z 2025-05-07T19:44:43.7442445Z 2025-05-07T19:44:43.7442449Z 2025-05-07T19:44:43.7442452Z 2025-05-07T19:44:43.7442456Z 2025-05-07T19:44:43.7442459Z 2025-05-07T19:44:43.7442471Z 2025-05-07T19:44:43.7445642Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.7445952Z 2025-05-07T19:44:43.7445956Z 2025-05-07T19:44:43.7445960Z 2025-05-07T19:44:43.7445963Z 2025-05-07T19:44:43.7445967Z 2025-05-07T19:44:43.7445970Z 2025-05-07T19:44:43.7445974Z 2025-05-07T19:44:43.7445977Z 2025-05-07T19:44:43.7445982Z 2025-05-07T19:44:43.7445985Z 2025-05-07T19:44:43.7445996Z 2025-05-07T19:44:43.8033537Z clang-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.8033894Z 2025-05-07T19:44:43.8305493Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:43.8305786Z 2025-05-07T19:44:43.8305931Z 2025-05-07T19:44:43.8306184Z 2025-05-07T19:44:43.8306199Z 2025-05-07T19:44:43.8306207Z 2025-05-07T19:44:43.8306213Z 2025-05-07T19:44:43.8306219Z 2025-05-07T19:44:43.8306225Z 2025-05-07T19:44:43.8306231Z 2025-05-07T19:44:43.8306237Z 2025-05-07T19:44:43.8306242Z 2025-05-07T19:44:43.8306248Z 2025-05-07T19:44:43.8308525Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.8308879Z 2025-05-07T19:44:43.8308898Z 2025-05-07T19:44:43.8308902Z 2025-05-07T19:44:43.8308907Z 2025-05-07T19:44:43.8308912Z 2025-05-07T19:44:43.8308917Z 2025-05-07T19:44:43.8308922Z 2025-05-07T19:44:43.8308928Z 2025-05-07T19:44:43.8308957Z 2025-05-07T19:44:43.8308961Z 2025-05-07T19:44:43.8308966Z 2025-05-07T19:44:43.8308969Z 2025-05-07T19:44:43.8555029Z clangxx-16.0.6 | 110 KB | ########## | 100%  2025-05-07T19:44:43.8555388Z 2025-05-07T19:44:43.8555393Z 2025-05-07T19:44:43.8555398Z 2025-05-07T19:44:43.8555401Z 2025-05-07T19:44:43.8561691Z icu-73.2 | 11.5 MB | ########## | 100%  2025-05-07T19:44:43.8561952Z 2025-05-07T19:44:43.8561961Z 2025-05-07T19:44:43.8704156Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:43.8705023Z 2025-05-07T19:44:43.8705036Z 2025-05-07T19:44:43.8705046Z 2025-05-07T19:44:43.8705056Z 2025-05-07T19:44:43.8705098Z 2025-05-07T19:44:43.8705109Z 2025-05-07T19:44:43.8705119Z 2025-05-07T19:44:43.8705130Z 2025-05-07T19:44:43.8705140Z 2025-05-07T19:44:43.8705150Z 2025-05-07T19:44:43.8705190Z 2025-05-07T19:44:43.8705200Z 2025-05-07T19:44:43.8705211Z 2025-05-07T19:44:43.8706100Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:43.8707049Z 2025-05-07T19:44:43.8707060Z 2025-05-07T19:44:43.8707085Z 2025-05-07T19:44:43.8707095Z 2025-05-07T19:44:43.8707106Z 2025-05-07T19:44:43.8707116Z 2025-05-07T19:44:43.8707126Z 2025-05-07T19:44:43.8707163Z 2025-05-07T19:44:43.8707173Z 2025-05-07T19:44:43.8707183Z 2025-05-07T19:44:43.8707194Z 2025-05-07T19:44:43.8707204Z 2025-05-07T19:44:43.8707215Z 2025-05-07T19:44:43.8745184Z compiler-rt-16.0.6 | 107 KB | ########## | 100%  2025-05-07T19:44:43.8746283Z 2025-05-07T19:44:43.8746297Z 2025-05-07T19:44:43.8746308Z 2025-05-07T19:44:43.8746319Z 2025-05-07T19:44:43.8746360Z 2025-05-07T19:44:43.8746372Z 2025-05-07T19:44:43.8746382Z 2025-05-07T19:44:43.8746393Z 2025-05-07T19:44:43.8746403Z 2025-05-07T19:44:43.8746414Z 2025-05-07T19:44:43.8746424Z 2025-05-07T19:44:43.8746434Z 2025-05-07T19:44:43.8746445Z 2025-05-07T19:44:43.8746456Z 2025-05-07T19:44:43.8747254Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:43.8748149Z 2025-05-07T19:44:43.8748549Z 2025-05-07T19:44:43.8748560Z 2025-05-07T19:44:43.8748571Z 2025-05-07T19:44:43.8748581Z 2025-05-07T19:44:43.8748591Z 2025-05-07T19:44:43.8748601Z 2025-05-07T19:44:43.8748612Z 2025-05-07T19:44:43.8748622Z 2025-05-07T19:44:43.8748633Z 2025-05-07T19:44:43.8748643Z 2025-05-07T19:44:43.8748654Z 2025-05-07T19:44:43.8748664Z 2025-05-07T19:44:43.8748675Z 2025-05-07T19:44:43.8869132Z zlib-1.2.13 | 91 KB | ########## | 100%  2025-05-07T19:44:43.8870098Z 2025-05-07T19:44:43.8870113Z 2025-05-07T19:44:43.8870155Z 2025-05-07T19:44:43.8870167Z 2025-05-07T19:44:43.8870178Z 2025-05-07T19:44:43.8870189Z 2025-05-07T19:44:43.8870199Z 2025-05-07T19:44:43.8870210Z 2025-05-07T19:44:43.8870220Z 2025-05-07T19:44:43.8870231Z 2025-05-07T19:44:43.8870241Z 2025-05-07T19:44:43.8870251Z 2025-05-07T19:44:43.8870291Z 2025-05-07T19:44:43.8870301Z 2025-05-07T19:44:43.8870312Z 2025-05-07T19:44:43.8871162Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:43.8872092Z 2025-05-07T19:44:43.8872102Z 2025-05-07T19:44:43.8872113Z 2025-05-07T19:44:43.8872123Z 2025-05-07T19:44:43.8872133Z 2025-05-07T19:44:43.8872143Z 2025-05-07T19:44:43.8872153Z 2025-05-07T19:44:43.8872183Z 2025-05-07T19:44:43.8872194Z 2025-05-07T19:44:43.8872204Z 2025-05-07T19:44:43.8872215Z 2025-05-07T19:44:43.8872225Z 2025-05-07T19:44:43.8872235Z 2025-05-07T19:44:43.8872245Z 2025-05-07T19:44:43.8872256Z 2025-05-07T19:44:43.9957300Z libzlib-1.2.13 | 60 KB | ########## | 100%  2025-05-07T19:44:44.1216226Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:44.1217074Z 2025-05-07T19:44:44.1217089Z 2025-05-07T19:44:44.1217101Z 2025-05-07T19:44:44.3032335Z libclang-cpp16-16.0. | 17.3 MB | ########## | 100%  2025-05-07T19:44:44.3032838Z 2025-05-07T19:44:44.3769797Z compiler-rt_linux-64 | 36.0 MB | ########## | 100%  2025-05-07T19:44:44.3770690Z 2025-05-07T19:44:44.3770703Z 2025-05-07T19:44:44.5604466Z libllvm16-16.0.6 | 33.7 MB | ########## | 100%  2025-05-07T19:44:44.5609646Z llvm-openmp-16.0.6 | 39.9 MB | ########## | 100% 2025-05-07T19:44:44.5610051Z 2025-05-07T19:44:44.5610417Z 2025-05-07T19:44:44.5610742Z  2025-05-07T19:44:44.5610997Z 2025-05-07T19:44:44.5611027Z 2025-05-07T19:44:44.5611235Z  2025-05-07T19:44:44.5611467Z 2025-05-07T19:44:44.5611498Z 2025-05-07T19:44:44.5611502Z 2025-05-07T19:44:44.5611685Z  2025-05-07T19:44:44.5611919Z 2025-05-07T19:44:44.5611923Z 2025-05-07T19:44:44.5611926Z 2025-05-07T19:44:44.5611930Z 2025-05-07T19:44:44.5612149Z  2025-05-07T19:44:44.5612379Z 2025-05-07T19:44:44.5612382Z 2025-05-07T19:44:44.5612394Z 2025-05-07T19:44:44.5612397Z 2025-05-07T19:44:44.5612400Z 2025-05-07T19:44:44.5612593Z  2025-05-07T19:44:44.5612858Z 2025-05-07T19:44:44.5612862Z 2025-05-07T19:44:44.5612865Z 2025-05-07T19:44:44.5612869Z 2025-05-07T19:44:44.5612872Z 2025-05-07T19:44:44.5612876Z 2025-05-07T19:44:44.5613066Z  2025-05-07T19:44:44.5613330Z 2025-05-07T19:44:44.5613333Z 2025-05-07T19:44:44.5613336Z 2025-05-07T19:44:44.5613344Z 2025-05-07T19:44:44.5613348Z 2025-05-07T19:44:44.5613351Z 2025-05-07T19:44:44.5613355Z 2025-05-07T19:44:44.5613547Z  2025-05-07T19:44:44.5613785Z 2025-05-07T19:44:44.5613789Z 2025-05-07T19:44:44.5613793Z 2025-05-07T19:44:44.5613820Z 2025-05-07T19:44:44.5613824Z 2025-05-07T19:44:44.5613827Z 2025-05-07T19:44:44.5613831Z 2025-05-07T19:44:44.5613834Z 2025-05-07T19:44:44.5614243Z  2025-05-07T19:44:44.5614485Z 2025-05-07T19:44:44.5614488Z 2025-05-07T19:44:44.5614493Z 2025-05-07T19:44:44.5614496Z 2025-05-07T19:44:44.5614499Z 2025-05-07T19:44:44.5614527Z 2025-05-07T19:44:44.5614531Z 2025-05-07T19:44:44.5614534Z 2025-05-07T19:44:44.5614537Z 2025-05-07T19:44:44.5614750Z  2025-05-07T19:44:44.5614991Z 2025-05-07T19:44:44.5614995Z 2025-05-07T19:44:44.5615003Z 2025-05-07T19:44:44.5615007Z 2025-05-07T19:44:44.5615010Z 2025-05-07T19:44:44.5615093Z 2025-05-07T19:44:44.5615097Z 2025-05-07T19:44:44.5615101Z 2025-05-07T19:44:44.5615104Z 2025-05-07T19:44:44.5615108Z 2025-05-07T19:44:44.5615317Z  2025-05-07T19:44:44.5615590Z 2025-05-07T19:44:44.5615593Z 2025-05-07T19:44:44.5615597Z 2025-05-07T19:44:44.5615600Z 2025-05-07T19:44:44.5615608Z 2025-05-07T19:44:44.5615611Z 2025-05-07T19:44:44.5615614Z 2025-05-07T19:44:44.5615618Z 2025-05-07T19:44:44.5615621Z 2025-05-07T19:44:44.5615624Z 2025-05-07T19:44:44.5615628Z 2025-05-07T19:44:44.5615839Z  2025-05-07T19:44:44.5616115Z 2025-05-07T19:44:44.5616119Z 2025-05-07T19:44:44.5616122Z 2025-05-07T19:44:44.5616126Z 2025-05-07T19:44:44.5616129Z 2025-05-07T19:44:44.5616133Z 2025-05-07T19:44:44.5616136Z 2025-05-07T19:44:44.5616139Z 2025-05-07T19:44:44.5616272Z 2025-05-07T19:44:44.5616277Z 2025-05-07T19:44:44.5616280Z 2025-05-07T19:44:44.5616283Z 2025-05-07T19:44:44.5616499Z  2025-05-07T19:44:44.5616783Z 2025-05-07T19:44:44.5616786Z 2025-05-07T19:44:44.5616789Z 2025-05-07T19:44:44.5616793Z 2025-05-07T19:44:44.5616796Z 2025-05-07T19:44:44.5616799Z 2025-05-07T19:44:44.5616803Z 2025-05-07T19:44:44.5616810Z 2025-05-07T19:44:44.5616814Z 2025-05-07T19:44:44.5616817Z 2025-05-07T19:44:44.5616821Z 2025-05-07T19:44:44.5616824Z 2025-05-07T19:44:44.5616828Z 2025-05-07T19:44:44.5617071Z  2025-05-07T19:44:44.5617323Z 2025-05-07T19:44:44.5617327Z 2025-05-07T19:44:44.5617330Z 2025-05-07T19:44:44.5617333Z 2025-05-07T19:44:44.5617337Z 2025-05-07T19:44:44.5617340Z 2025-05-07T19:44:44.5617343Z 2025-05-07T19:44:44.5617347Z 2025-05-07T19:44:44.5617350Z 2025-05-07T19:44:44.5617356Z 2025-05-07T19:44:44.5617360Z 2025-05-07T19:44:44.5617364Z 2025-05-07T19:44:44.5617367Z 2025-05-07T19:44:44.5617370Z 2025-05-07T19:44:44.5617611Z  2025-05-07T19:44:44.5617872Z 2025-05-07T19:44:44.5617876Z 2025-05-07T19:44:44.5617880Z 2025-05-07T19:44:44.5617883Z 2025-05-07T19:44:44.5617886Z 2025-05-07T19:44:44.5617889Z 2025-05-07T19:44:44.5617897Z 2025-05-07T19:44:44.5617900Z 2025-05-07T19:44:44.5617903Z 2025-05-07T19:44:44.5617907Z 2025-05-07T19:44:44.5617910Z 2025-05-07T19:44:44.5617913Z 2025-05-07T19:44:44.5617944Z 2025-05-07T19:44:44.5617947Z 2025-05-07T19:44:44.5617950Z 2025-05-07T19:44:44.5618210Z  done 2025-05-07T19:44:44.6624567Z Preparing transaction: | done 2025-05-07T19:44:44.7632681Z Verifying transaction: - done 2025-05-07T19:44:44.8646267Z Executing transaction: | done 2025-05-07T19:44:44.9552260Z [INSTALL] Setting the C/C++ compiler symlinks ... 2025-05-07T19:44:48.6480337Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:48.6481354Z 2025-05-07T19:44:48.6499545Z 2025-05-07T19:44:48.6519588Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:44:48.6521224Z 2025-05-07T19:44:48.6532911Z 2025-05-07T19:44:48.6549471Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:44:48.6550053Z 2025-05-07T19:44:48.6562628Z 2025-05-07T19:44:48.6580158Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:44:48.6580762Z 2025-05-07T19:44:48.6593480Z 2025-05-07T19:44:48.6593992Z + conda env config vars set -n build_binary CC= 2025-05-07T19:44:48.6594712Z 2025-05-07T19:44:49.0714430Z 2025-05-07T19:44:49.0715077Z + conda env config vars set -n build_binary CXX= 2025-05-07T19:44:49.0715855Z 2025-05-07T19:44:49.4871654Z 2025-05-07T19:44:49.4872075Z + conda run -n build_binary printenv CC 2025-05-07T19:44:49.4872368Z 2025-05-07T19:44:51.2615581Z /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-cc 2025-05-07T19:44:51.2616030Z 2025-05-07T19:44:51.3189347Z 2025-05-07T19:44:51.3189922Z + conda run -n build_binary printenv CXX 2025-05-07T19:44:51.3190317Z 2025-05-07T19:44:53.0738667Z /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-c++ 2025-05-07T19:44:53.0739881Z 2025-05-07T19:44:53.1298321Z 2025-05-07T19:44:54.9910554Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib ... 2025-05-07T19:44:56.7833907Z ERROR conda.cli.main_run:execute(125): `conda run printenv LD_LIBRARY_PATH` failed. (See above for error) 2025-05-07T19:44:56.8436310Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib 2025-05-07T19:44:56.8436825Z 2025-05-07T19:44:57.2614464Z 2025-05-07T19:44:59.0548098Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:44:59.0548962Z 2025-05-07T19:44:59.1273055Z [CHECK] Binary cc found in PATH 2025-05-07T19:45:00.9155185Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:45:00.9155539Z 2025-05-07T19:45:00.9883790Z [CHECK] Binary gcc found in PATH 2025-05-07T19:45:02.7771873Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:45:02.7772307Z 2025-05-07T19:45:02.8496919Z [CHECK] Binary c++ found in PATH 2025-05-07T19:45:04.6513529Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:45:04.6513910Z 2025-05-07T19:45:04.7094763Z [CHECK] Binary g++ found in PATH 2025-05-07T19:45:04.7095290Z [INFO] Printing out all preprocessor defines in the C compiler ... 2025-05-07T19:45:04.7095779Z + conda run -n build_binary cc -dM -E - 2025-05-07T19:45:04.7096022Z 2025-05-07T19:45:06.5260897Z #define _LP64 1 2025-05-07T19:45:06.5261356Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:06.5261668Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:06.5261993Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:06.5262276Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:06.5262593Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:06.5262877Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:06.5263200Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:06.5263553Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:06.5263943Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:06.5264290Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:06.5264676Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:06.5265058Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:06.5265373Z #define __CHAR_BIT__ 8 2025-05-07T19:45:06.5265690Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:06.5266048Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:06.5266454Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:06.5266840Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:06.5267212Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:06.5267571Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:06.5267913Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:06.5268280Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:06.5268634Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:06.5269014Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:06.5269476Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:06.5270135Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:06.5270502Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:06.5270859Z #define __DBL_DIG__ 15 2025-05-07T19:45:06.5271174Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:06.5271523Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:06.5271837Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:06.5272132Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.5272455Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:06.5272870Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:06.5273193Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:06.5273500Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:06.5273875Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:06.5274216Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:06.5274517Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:06.5274890Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:06.5275220Z #define __ELF__ 1 2025-05-07T19:45:06.5275497Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:06.5275779Z #define __FLOAT128__ 1 2025-05-07T19:45:06.5276071Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:06.5276399Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:06.5276772Z #define __FLT16_DIG__ 3 2025-05-07T19:45:06.5277045Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:06.5277397Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:06.5277720Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:06.5278172Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.5278511Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:06.5278800Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:06.5279111Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:06.5279399Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:06.5279728Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:06.5280028Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:06.5280352Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:06.5280681Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:06.5281012Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:06.5281361Z #define __FLT_DIG__ 6 2025-05-07T19:45:06.5281631Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:06.5281978Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:06.5282278Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:06.5282601Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.5282902Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:06.5283217Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:06.5283506Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:06.5283818Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:06.5284127Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:06.5284456Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:06.5284777Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:06.5285185Z #define __FLT_RADIX__ 2 2025-05-07T19:45:06.5285469Z #define __FXSR__ 1 2025-05-07T19:45:06.5285725Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:06.5286077Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:06.5286408Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:06.5286778Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:06.5287115Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:06.5287456Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:06.5287779Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:06.5288127Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:06.5288486Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:06.5288827Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:06.5289215Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:06.5289559Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:06.5289923Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:06.5290255Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:06.5290648Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:06.5291007Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:06.5291480Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:06.5291845Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:06.5292125Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:06.5292457Z #define __GNUC_STDC_INLINE__ 1 2025-05-07T19:45:06.5292736Z #define __GNUC__ 4 2025-05-07T19:45:06.5293016Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:06.5293301Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:06.5293613Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:06.5293881Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:06.5294176Z #define __INT16_MAX__ 32767 2025-05-07T19:45:06.5294445Z #define __INT16_TYPE__ short 2025-05-07T19:45:06.5294752Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:06.5295050Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:06.5295313Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:06.5295609Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:06.5295895Z #define __INT32_TYPE__ int 2025-05-07T19:45:06.5296186Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:06.5296467Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:06.5296763Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:06.5297048Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5297403Z #define __INT64_TYPE__ long int 2025-05-07T19:45:06.5297686Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:06.5297983Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:06.5298287Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:06.5298559Z #define __INT8_MAX__ 127 2025-05-07T19:45:06.5298868Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:06.5299170Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:06.5299576Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:06.5299861Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:06.5300182Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5300514Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:06.5300845Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:06.5301125Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:06.5301446Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:06.5301753Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5302116Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:06.5302443Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:06.5302723Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:06.5303041Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:06.5303336Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:06.5303655Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:06.5303955Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:06.5304269Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:06.5304554Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:06.5304868Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:06.5305173Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:06.5305484Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:06.5305796Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:06.5306087Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:06.5306422Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5306761Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:06.5307098Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:06.5307384Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:06.5307705Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:06.5307992Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:06.5308313Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:06.5308650Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:06.5308935Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:06.5309261Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:06.5309560Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:06.5309892Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:06.5310192Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:06.5310510Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:06.5310796Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:06.5311114Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:06.5311422Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:06.5311736Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:06.5312138Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:06.5312434Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:06.5312859Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5313378Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:06.5313730Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:06.5314098Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:06.5314433Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:06.5314744Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:06.5315095Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:06.5315424Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:06.5315749Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:06.5316056Z #define __INT_WIDTH__ 32 2025-05-07T19:45:06.5316329Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:06.5316706Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:06.5317086Z #define __LDBL_DIG__ 18 2025-05-07T19:45:06.5317421Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:06.5317785Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:06.5318107Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:06.5318408Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:06.5318736Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:06.5319028Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:06.5319359Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:06.5319710Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:06.5320062Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:06.5320441Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:06.5320745Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:06.5321079Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:06.5321340Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:06.5321656Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:06.5322004Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5322317Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:06.5322574Z #define __LP64__ 1 2025-05-07T19:45:06.5322797Z #define __MMX__ 1 2025-05-07T19:45:06.5323032Z #define __NO_INLINE__ 1 2025-05-07T19:45:06.5323275Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:06.5323584Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:06.5323906Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:06.5324264Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:06.5324582Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:06.5324928Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:06.5325365Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:06.5325700Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:06.5326036Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:06.5326355Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:06.5326671Z #define __PIC__ 2 2025-05-07T19:45:06.5326908Z #define __PIE__ 2 2025-05-07T19:45:06.5327195Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:06.5327492Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:06.5327838Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:06.5328127Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:06.5328450Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:06.5328777Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:06.5329108Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:06.5329414Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:06.5329687Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:06.5329934Z #define __SEG_FS 1 2025-05-07T19:45:06.5330141Z #define __SEG_GS 1 2025-05-07T19:45:06.5330367Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:06.5330612Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:06.5330873Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:06.5331157Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:06.5331799Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:06.5332059Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:06.5332452Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:06.5332754Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:06.5333012Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:06.5333432Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:06.5333714Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:06.5333991Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:06.5334242Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:06.5334559Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:06.5334855Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:06.5335174Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:06.5335469Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:06.5335793Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:06.5336107Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:06.5336383Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:06.5336697Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:06.5336974Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:06.5337297Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.5337635Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:06.5337981Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:06.5338246Z #define __SSE2_MATH__ 1 2025-05-07T19:45:06.5338526Z #define __SSE2__ 1 2025-05-07T19:45:06.5338771Z #define __SSE_MATH__ 1 2025-05-07T19:45:06.5339053Z #define __SSE__ 1 2025-05-07T19:45:06.5339294Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:06.5339589Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:06.5339885Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:06.5340163Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:06.5340474Z #define __STDC__ 1 2025-05-07T19:45:06.5340723Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:06.5341035Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:06.5341317Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:06.5341754Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:06.5342038Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:06.5342350Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:06.5342647Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:06.5343001Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:06.5343320Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:06.5343595Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:06.5343900Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:06.5344288Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:06.5344584Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:06.5344875Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:06.5345192Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:06.5345466Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:06.5345762Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:06.5346027Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:06.5346322Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:06.5346633Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.5346962Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:06.5347301Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:06.5347568Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:06.5347874Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:06.5348145Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:06.5348441Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:06.5348701Z #define __UINT8_MAX__ 255 2025-05-07T19:45:06.5348994Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:06.5349302Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:06.5349613Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:06.5349917Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:06.5350188Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:06.5350485Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:06.5350778Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.5351148Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:06.5351464Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:06.5351770Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:06.5352045Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:06.5352351Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:06.5352743Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:06.5353220Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.5353616Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:06.5354051Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:06.5354378Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:06.5354791Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:06.5355138Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:06.5355448Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:06.5355794Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:06.5356123Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:06.5356502Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:06.5356843Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:06.5357159Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:06.5357506Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:06.5357817Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:06.5358197Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:06.5358612Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:06.5358922Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:06.5359267Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:06.5359579Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:06.5359954Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.5360345Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:06.5360739Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:06.5361061Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:06.5361403Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:06.5361738Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:06.5362036Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:06.5362366Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:06.5362701Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:06.5363106Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:06.5363421Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:06.5363756Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:06.5364062Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:06.5364418Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:06.5364760Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:06.5365200Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:06.5365516Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:06.5365807Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:06.5366135Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:06.5366461Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:06.5366809Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:06.5367101Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:06.5367416Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:06.5367710Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:06.5368053Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:06.5368448Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:06.5368782Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:06.5369102Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:06.5369390Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:06.5369708Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:06.5369995Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:06.5370315Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:06.5370636Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:06.5371289Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:06.5371931Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:06.5372215Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:06.5372509Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:06.5372773Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:06.5373245Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:06.5373553Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:06.5373855Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:06.5374111Z #define __amd64 1 2025-05-07T19:45:06.5374375Z #define __amd64__ 1 2025-05-07T19:45:06.5374610Z #define __clang__ 1 2025-05-07T19:45:06.5374904Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:06.5375226Z #define __clang_major__ 16 2025-05-07T19:45:06.5375520Z #define __clang_minor__ 0 2025-05-07T19:45:06.5375817Z #define __clang_patchlevel__ 6 2025-05-07T19:45:06.5376516Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:06.5377225Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:06.5377579Z #define __code_model_small__ 1 2025-05-07T19:45:06.5377894Z #define __gnu_linux__ 1 2025-05-07T19:45:06.5378151Z #define __k8 1 2025-05-07T19:45:06.5378411Z #define __k8__ 1 2025-05-07T19:45:06.5378638Z #define __linux 1 2025-05-07T19:45:06.5378906Z #define __linux__ 1 2025-05-07T19:45:06.5379174Z #define __llvm__ 1 2025-05-07T19:45:06.5379407Z #define __pic__ 2 2025-05-07T19:45:06.5379661Z #define __pie__ 2 2025-05-07T19:45:06.5379947Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:06.5380370Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:06.5380721Z #define __tune_k8__ 1 2025-05-07T19:45:06.5380989Z #define __unix 1 2025-05-07T19:45:06.5381216Z #define __unix__ 1 2025-05-07T19:45:06.5381492Z #define __x86_64 1 2025-05-07T19:45:06.5381724Z #define __x86_64__ 1 2025-05-07T19:45:06.5381990Z #define linux 1 2025-05-07T19:45:06.5382224Z #define unix 1 2025-05-07T19:45:06.5382402Z 2025-05-07T19:45:06.5829059Z 2025-05-07T19:45:06.5829673Z [INFO] Printing out all preprocessor defines in the C++ compiler ... 2025-05-07T19:45:06.5830255Z + conda run -n build_binary c++ -dM -E -x c++ - 2025-05-07T19:45:06.5830518Z 2025-05-07T19:45:08.3834464Z #define _GNU_SOURCE 1 2025-05-07T19:45:08.3834860Z #define _LP64 1 2025-05-07T19:45:08.3835502Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:45:08.3835803Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:45:08.3836129Z #define __ATOMIC_CONSUME 1 2025-05-07T19:45:08.3836417Z #define __ATOMIC_RELAXED 0 2025-05-07T19:45:08.3836857Z #define __ATOMIC_RELEASE 3 2025-05-07T19:45:08.3837139Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:45:08.3837464Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:45:08.3837779Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:45:08.3838129Z #define __BOOL_WIDTH__ 8 2025-05-07T19:45:08.3838462Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:45:08.3838853Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:45:08.3839217Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:45:08.3839525Z #define __CHAR_BIT__ 8 2025-05-07T19:45:08.3839836Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.3840181Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.3840555Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.3840901Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.3841260Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.3841595Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.3841964Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.3842331Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.3842677Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.3843046Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.3843386Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:45:08.3843730Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:45:08.3844062Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:45:08.3844436Z #define __DBL_DIG__ 15 2025-05-07T19:45:08.3844724Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:45:08.3845097Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:45:08.3845390Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.3845711Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.3846034Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:45:08.3846330Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:45:08.3846654Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:45:08.3846951Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:45:08.3847308Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:45:08.3847616Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:45:08.3847946Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:45:08.3848293Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:45:08.3848762Z #define __DEPRECATED 1 2025-05-07T19:45:08.3851451Z #define __ELF__ 1 2025-05-07T19:45:08.3851716Z #define __EXCEPTIONS 1 2025-05-07T19:45:08.3852000Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:45:08.3852275Z #define __FLOAT128__ 1 2025-05-07T19:45:08.3852567Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:45:08.3852883Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:45:08.3853238Z #define __FLT16_DIG__ 3 2025-05-07T19:45:08.3853495Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:45:08.3853829Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:45:08.3854126Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:45:08.3854438Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.3854725Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:45:08.3855023Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:45:08.3855319Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:45:08.3855588Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:45:08.3855902Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:45:08.3856191Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:45:08.3856503Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:45:08.3856801Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:45:08.3857119Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:45:08.3857419Z #define __FLT_DIG__ 6 2025-05-07T19:45:08.3857695Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:45:08.3857994Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:45:08.3858295Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:45:08.3858594Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.3858949Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:45:08.3859242Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:45:08.3859509Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:45:08.3859806Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:45:08.3860099Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:45:08.3860408Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:45:08.3860685Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:45:08.3861001Z #define __FLT_RADIX__ 2 2025-05-07T19:45:08.3861254Z #define __FXSR__ 1 2025-05-07T19:45:08.3861528Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:45:08.3861854Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:45:08.3862176Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:45:08.3862533Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:45:08.3862859Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:45:08.3863198Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:45:08.3863500Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:45:08.3863839Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:45:08.3864150Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:45:08.3864494Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:45:08.3864842Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:45:08.3865167Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:45:08.3865516Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:45:08.3865831Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:45:08.3866199Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:45:08.3866541Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:45:08.3866902Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:45:08.3867226Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:45:08.3867560Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:45:08.3867887Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:45:08.3868155Z #define __GNUC_MINOR__ 2 2025-05-07T19:45:08.3868434Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:45:08.3868734Z #define __GNUC__ 4 2025-05-07T19:45:08.3868996Z #define __GNUG__ 4 2025-05-07T19:45:08.3869236Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:45:08.3869554Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:45:08.3869845Z #define __GXX_RTTI 1 2025-05-07T19:45:08.3870126Z #define __GXX_WEAK__ 1 2025-05-07T19:45:08.3870382Z #define __INT16_C_SUFFIX__ 2025-05-07T19:45:08.3870673Z #define __INT16_FMTd__ "hd" 2025-05-07T19:45:08.3870963Z #define __INT16_FMTi__ "hi" 2025-05-07T19:45:08.3871233Z #define __INT16_MAX__ 32767 2025-05-07T19:45:08.3871621Z #define __INT16_TYPE__ short 2025-05-07T19:45:08.3871898Z #define __INT32_C_SUFFIX__ 2025-05-07T19:45:08.3872195Z #define __INT32_FMTd__ "d" 2025-05-07T19:45:08.3872462Z #define __INT32_FMTi__ "i" 2025-05-07T19:45:08.3872862Z #define __INT32_MAX__ 2147483647 2025-05-07T19:45:08.3873326Z #define __INT32_TYPE__ int 2025-05-07T19:45:08.3873642Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:45:08.3874001Z #define __INT64_FMTd__ "ld" 2025-05-07T19:45:08.3874320Z #define __INT64_FMTi__ "li" 2025-05-07T19:45:08.3874653Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3874991Z #define __INT64_TYPE__ long int 2025-05-07T19:45:08.3875325Z #define __INT8_C_SUFFIX__ 2025-05-07T19:45:08.3875610Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:45:08.3875924Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:45:08.3876213Z #define __INT8_MAX__ 127 2025-05-07T19:45:08.3876527Z #define __INT8_TYPE__ signed char 2025-05-07T19:45:08.3876830Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:45:08.3877163Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:45:08.3877448Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:45:08.3877771Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3878137Z #define __INTMAX_TYPE__ long int 2025-05-07T19:45:08.3878428Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:45:08.3878742Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:45:08.3879026Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:45:08.3879353Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3879683Z #define __INTPTR_TYPE__ long int 2025-05-07T19:45:08.3880085Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:45:08.3880376Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:45:08.3880704Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:45:08.3881009Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:45:08.3881345Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:45:08.3881684Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:45:08.3881980Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:45:08.3882310Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:45:08.3882616Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:45:08.3882959Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:45:08.3883251Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:45:08.3883575Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:45:08.3883876Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:45:08.3884227Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3884576Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:45:08.3884924Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:45:08.3885356Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:45:08.3885640Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:45:08.3885951Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:45:08.3886231Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:45:08.3886558Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:45:08.3886831Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:45:08.3887146Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:45:08.3887427Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:45:08.3887750Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:45:08.3888038Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:45:08.3888343Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:45:08.3888646Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:45:08.3888928Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:45:08.3889249Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:45:08.3889524Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:45:08.3889832Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:45:08.3890118Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:45:08.3890452Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3890787Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:45:08.3891112Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:45:08.3891391Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:45:08.3891698Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:45:08.3892006Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:45:08.3892290Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:45:08.3892715Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:45:08.3892988Z #define __INT_MAX__ 2147483647 2025-05-07T19:45:08.3893287Z #define __INT_WIDTH__ 32 2025-05-07T19:45:08.3893549Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:45:08.3893904Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:45:08.3894253Z #define __LDBL_DIG__ 18 2025-05-07T19:45:08.3894564Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:45:08.3894922Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:45:08.3895202Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:45:08.3895509Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:45:08.3895792Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:45:08.3896097Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:45:08.3896380Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:45:08.3896711Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:45:08.3897046Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:45:08.3897387Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:45:08.3897704Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:45:08.3898058Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:45:08.3898349Z #define __LLONG_WIDTH__ 64 2025-05-07T19:45:08.3898632Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:45:08.3898987Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3899293Z #define __LONG_WIDTH__ 64 2025-05-07T19:45:08.3899572Z #define __LP64__ 1 2025-05-07T19:45:08.3899797Z #define __MMX__ 1 2025-05-07T19:45:08.3900055Z #define __NO_INLINE__ 1 2025-05-07T19:45:08.3900373Z #define __NO_MATH_INLINES 1 2025-05-07T19:45:08.3900678Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:45:08.3900993Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:45:08.3901374Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:45:08.3901736Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:45:08.3902072Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:45:08.3902436Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:45:08.3902752Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:45:08.3903084Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:45:08.3903395Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:45:08.3903711Z #define __PIC__ 2 2025-05-07T19:45:08.3903946Z #define __PIE__ 2 2025-05-07T19:45:08.3904224Z #define __POINTER_WIDTH__ 64 2025-05-07T19:45:08.3904517Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:45:08.3904858Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:45:08.3905165Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:45:08.3905468Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:45:08.3905809Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:45:08.3906108Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:45:08.3906414Z #define __REGISTER_PREFIX__ 2025-05-07T19:45:08.3906692Z #define __SCHAR_MAX__ 127 2025-05-07T19:45:08.3906981Z #define __SEG_FS 1 2025-05-07T19:45:08.3907219Z #define __SEG_GS 1 2025-05-07T19:45:08.3907497Z #define __SHRT_MAX__ 32767 2025-05-07T19:45:08.3907770Z #define __SHRT_WIDTH__ 16 2025-05-07T19:45:08.3908061Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:45:08.3908386Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:45:08.3908660Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:45:08.3908954Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:45:08.3909227Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:45:08.3909520Z #define __SIZEOF_INT128__ 16 2025-05-07T19:45:08.3909784Z #define __SIZEOF_INT__ 4 2025-05-07T19:45:08.3910070Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:45:08.3910359Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:45:08.3910650Z #define __SIZEOF_LONG__ 8 2025-05-07T19:45:08.3910908Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:45:08.3911206Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:45:08.3911511Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:45:08.3911774Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:45:08.3912079Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:45:08.3912353Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:45:08.3912723Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:45:08.3913308Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:45:08.3913615Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:45:08.3913961Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:45:08.3914287Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.3914633Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:45:08.3914994Z #define __SIZE_WIDTH__ 64 2025-05-07T19:45:08.3915267Z #define __SSE2_MATH__ 1 2025-05-07T19:45:08.3915553Z #define __SSE2__ 1 2025-05-07T19:45:08.3915830Z #define __SSE_MATH__ 1 2025-05-07T19:45:08.3916085Z #define __SSE__ 1 2025-05-07T19:45:08.3916402Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:45:08.3916754Z #define __STDCPP_THREADS__ 1 2025-05-07T19:45:08.3917075Z #define __STDC_HOSTED__ 1 2025-05-07T19:45:08.3917345Z #define __STDC_UTF_16__ 1 2025-05-07T19:45:08.3917642Z #define __STDC_UTF_32__ 1 2025-05-07T19:45:08.3917903Z #define __STDC__ 1 2025-05-07T19:45:08.3918183Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:45:08.3918465Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:45:08.3918775Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:45:08.3919086Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:45:08.3919370Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:45:08.3919672Z #define __UINT16_MAX__ 65535 2025-05-07T19:45:08.3919965Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:45:08.3920312Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:45:08.3920595Z #define __UINT32_FMTX__ "X" 2025-05-07T19:45:08.3920905Z #define __UINT32_FMTo__ "o" 2025-05-07T19:45:08.3921183Z #define __UINT32_FMTu__ "u" 2025-05-07T19:45:08.3921551Z #define __UINT32_FMTx__ "x" 2025-05-07T19:45:08.3921832Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:45:08.3922172Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:45:08.3922514Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:45:08.3922809Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:45:08.3923120Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:45:08.3923406Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:45:08.3923716Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:45:08.3924021Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.3924391Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:45:08.3924720Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:45:08.3925028Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:45:08.3925407Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:45:08.3925695Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:45:08.3925981Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:45:08.3926241Z #define __UINT8_MAX__ 255 2025-05-07T19:45:08.3926531Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:45:08.3926836Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:45:08.3927144Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:45:08.3927424Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:45:08.3927728Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:45:08.3928008Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:45:08.3928335Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.3928672Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:45:08.3929016Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:45:08.3929320Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:45:08.3929591Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:45:08.3929888Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:45:08.3930161Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:45:08.3930489Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.3930843Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:45:08.3931191Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:45:08.3931796Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:45:08.3932158Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:45:08.3932591Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:45:08.3932941Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:45:08.3933286Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:45:08.3933613Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:45:08.3933998Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:45:08.3934300Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:45:08.3934638Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:45:08.3935083Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:45:08.3935424Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:45:08.3935769Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:45:08.3936146Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:45:08.3936459Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:45:08.3936810Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:45:08.3937158Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:45:08.3937499Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.3937919Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.3938278Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:45:08.3938621Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:45:08.3938940Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:45:08.3939288Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:45:08.3939603Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:45:08.3939912Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:45:08.3940240Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:45:08.3940522Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:45:08.3940818Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:45:08.3941098Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:45:08.3941389Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:45:08.3941687Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:45:08.3942015Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:45:08.3942294Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:45:08.3942677Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:45:08.3942955Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:45:08.3943250Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:45:08.3943584Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:45:08.3943893Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:45:08.3944199Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:45:08.3944578Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:45:08.3944895Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:45:08.3945217Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:45:08.3945609Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:45:08.3945950Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:45:08.3946271Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:45:08.3946592Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:45:08.3946881Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:45:08.3947204Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:45:08.3947493Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:45:08.3947839Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:45:08.3948471Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.3949116Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:45:08.3949402Z #define __WCHAR_TYPE__ int 2025-05-07T19:45:08.3949695Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:45:08.3949990Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:45:08.3950277Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:45:08.3950596Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:45:08.3950865Z #define __WINT_WIDTH__ 32 2025-05-07T19:45:08.3951145Z #define __amd64 1 2025-05-07T19:45:08.3951369Z #define __amd64__ 1 2025-05-07T19:45:08.3951625Z #define __clang__ 1 2025-05-07T19:45:08.3951882Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:45:08.3952217Z #define __clang_major__ 16 2025-05-07T19:45:08.3952473Z #define __clang_minor__ 0 2025-05-07T19:45:08.3952843Z #define __clang_patchlevel__ 6 2025-05-07T19:45:08.3953680Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:45:08.3954385Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:45:08.3954777Z #define __code_model_small__ 1 2025-05-07T19:45:08.3955076Z #define __cplusplus 201703L 2025-05-07T19:45:08.3955408Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:45:08.3955742Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:45:08.3956192Z #define __cpp_alias_templates 200704L 2025-05-07T19:45:08.3956519Z #define __cpp_aligned_new 201606L 2025-05-07T19:45:08.3956863Z #define __cpp_attributes 200809L 2025-05-07T19:45:08.3957176Z #define __cpp_binary_literals 201304L 2025-05-07T19:45:08.3957535Z #define __cpp_capture_star_this 201603L 2025-05-07T19:45:08.3957895Z #define __cpp_constexpr 201603L 2025-05-07T19:45:08.3958215Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:45:08.3958585Z #define __cpp_decltype 200707L 2025-05-07T19:45:08.3958895Z #define __cpp_decltype_auto 201304L 2025-05-07T19:45:08.3959250Z #define __cpp_deduction_guides 201703L 2025-05-07T19:45:08.3959603Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:45:08.3959990Z #define __cpp_digit_separators 201309L 2025-05-07T19:45:08.3960335Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:45:08.3960709Z #define __cpp_exceptions 199711L 2025-05-07T19:45:08.3961055Z #define __cpp_fold_expressions 201603L 2025-05-07T19:45:08.3961396Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:45:08.3961778Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:45:08.3962136Z #define __cpp_hex_float 201603L 2025-05-07T19:45:08.3962480Z #define __cpp_if_constexpr 201606L 2025-05-07T19:45:08.3962822Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:45:08.3963221Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:45:08.3963580Z #define __cpp_init_captures 201304L 2025-05-07T19:45:08.3963936Z #define __cpp_initializer_lists 200806L 2025-05-07T19:45:08.3964370Z #define __cpp_inline_variables 201606L 2025-05-07T19:45:08.3964700Z #define __cpp_lambdas 200907L 2025-05-07T19:45:08.3965053Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:45:08.3965515Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:45:08.3965908Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:45:08.3966284Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:45:08.3966654Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:45:08.3967017Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:45:08.3967411Z #define __cpp_nsdmi 200809L 2025-05-07T19:45:08.3967693Z #define __cpp_range_based_for 201603L 2025-05-07T19:45:08.3968037Z #define __cpp_raw_strings 200710L 2025-05-07T19:45:08.3968364Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:45:08.3968683Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:45:08.3969020Z #define __cpp_rtti 199711L 2025-05-07T19:45:08.3969298Z #define __cpp_rvalue_references 200610L 2025-05-07T19:45:08.3969653Z #define __cpp_static_assert 201411L 2025-05-07T19:45:08.3969968Z #define __cpp_static_call_operator 202207L 2025-05-07T19:45:08.3970333Z #define __cpp_structured_bindings 201606L 2025-05-07T19:45:08.3970665Z #define __cpp_template_auto 201606L 2025-05-07T19:45:08.3971016Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:45:08.3971388Z #define __cpp_unicode_characters 200704L 2025-05-07T19:45:08.3971721Z #define __cpp_unicode_literals 200710L 2025-05-07T19:45:08.3972087Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:45:08.3972426Z #define __cpp_variable_templates 201304L 2025-05-07T19:45:08.3972784Z #define __cpp_variadic_templates 200704L 2025-05-07T19:45:08.3973111Z #define __cpp_variadic_using 201611L 2025-05-07T19:45:08.3973440Z #define __gnu_linux__ 1 2025-05-07T19:45:08.3973695Z #define __k8 1 2025-05-07T19:45:08.3973960Z #define __k8__ 1 2025-05-07T19:45:08.3974190Z #define __linux 1 2025-05-07T19:45:08.3974451Z #define __linux__ 1 2025-05-07T19:45:08.3974698Z #define __llvm__ 1 2025-05-07T19:45:08.3974911Z #define __pic__ 2 2025-05-07T19:45:08.3975164Z #define __pie__ 2 2025-05-07T19:45:08.3993650Z #define __private_extern__ extern 2025-05-07T19:45:08.3994164Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:45:08.3994563Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:45:08.3994935Z #define __tune_k8__ 1 2025-05-07T19:45:08.3995197Z #define __unix 1 2025-05-07T19:45:08.3995475Z #define __unix__ 1 2025-05-07T19:45:08.3995884Z #define __x86_64 1 2025-05-07T19:45:08.3996167Z #define __x86_64__ 1 2025-05-07T19:45:08.3996424Z #define linux 1 2025-05-07T19:45:08.3996697Z #define unix 1 2025-05-07T19:45:08.3996847Z 2025-05-07T19:45:08.4420687Z 2025-05-07T19:45:08.4421546Z + conda run -n build_binary c++ --version 2025-05-07T19:45:08.4421854Z 2025-05-07T19:45:10.2538687Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:45:10.2539387Z Target: x86_64-conda-linux-gnu 2025-05-07T19:45:10.2539784Z Thread model: posix 2025-05-07T19:45:10.2540160Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:45:10.2540835Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:45:10.2541325Z 2025-05-07T19:45:10.3153890Z 2025-05-07T19:45:10.3154555Z [INFO] Printing the default version of the C standard used by the compiler ... 2025-05-07T19:45:10.3155265Z + conda run -n build_binary cc -dM -E - < /dev/null | grep __STDC_VERSION__ 2025-05-07T19:45:10.3155666Z 2025-05-07T19:45:12.2003304Z #define __STDC_VERSION__ 201710L 2025-05-07T19:45:12.2007414Z 2025-05-07T19:45:12.2008224Z [INFO] Printing the default version of the C++ standard used by the compiler ... 2025-05-07T19:45:12.2008906Z + conda run -n build_binary c++ -dM -E -x c++ - < /dev/null | grep __cplusplus 2025-05-07T19:45:12.2009287Z 2025-05-07T19:45:14.0693429Z #define __cplusplus 201703L 2025-05-07T19:45:14.0696224Z 2025-05-07T19:45:14.0696693Z [INSTALL] Successfully installed C/C++ compilers 2025-05-07T19:45:14.0762527Z ##[group]Run . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:14.0762944Z . $PRELUDE; install_build_tools $BUILD_ENV 2025-05-07T19:45:14.0763682Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:14.0764011Z env: 2025-05-07T19:45:14.0764224Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:14.0764527Z BUILD_ENV: build_binary 2025-05-07T19:45:14.0764782Z BUILD_TARGET: default 2025-05-07T19:45:14.0765011Z BUILD_VARIANT: cuda 2025-05-07T19:45:14.0765241Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:14.0765503Z ##[endgroup] 2025-05-07T19:45:14.5106186Z ################################################################################ 2025-05-07T19:45:14.5106616Z # Install Build Tools 2025-05-07T19:45:14.5107036Z # 2025-05-07T19:45:14.5116662Z # [2025-05-07T19:45:14.511Z] + install_build_tools build_binary 2025-05-07T19:45:14.5117081Z ################################################################################ 2025-05-07T19:45:14.5117443Z 2025-05-07T19:45:14.5133902Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:14.5966976Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:14.5976882Z [INSTALL] Installing build tools ... 2025-05-07T19:45:14.5999756Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y auditwheel bazel cmake>=3.30 hypothesis jinja2 make ncurses ninja openblas patchelf rhash scikit-build wheel pyyaml 2025-05-07T19:45:15.3171578Z Channels: 2025-05-07T19:45:15.3172279Z - conda-forge 2025-05-07T19:45:15.3172931Z Platform: linux-64 2025-05-07T19:45:18.4734304Z Collecting package metadata (repodata.json): - \ | / done 2025-05-07T19:45:22.1955543Z Solving environment: \ | / - done 2025-05-07T19:45:22.2558323Z 2025-05-07T19:45:22.2558893Z ## Package Plan ## 2025-05-07T19:45:22.2559507Z 2025-05-07T19:45:22.2560114Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:22.2561050Z 2025-05-07T19:45:22.2561410Z added / updated specs: 2025-05-07T19:45:22.2562128Z - auditwheel 2025-05-07T19:45:22.2562762Z - bazel 2025-05-07T19:45:22.2563361Z - cmake[version='>=3.30'] 2025-05-07T19:45:22.2564134Z - hypothesis 2025-05-07T19:45:22.2564733Z - jinja2 2025-05-07T19:45:22.2565455Z - make 2025-05-07T19:45:22.2565671Z - ncurses 2025-05-07T19:45:22.2565916Z - ninja 2025-05-07T19:45:22.2566461Z - openblas 2025-05-07T19:45:22.2566722Z - patchelf 2025-05-07T19:45:22.2566952Z - pyyaml 2025-05-07T19:45:22.2567204Z - rhash 2025-05-07T19:45:22.2567450Z - scikit-build 2025-05-07T19:45:22.2567690Z - wheel 2025-05-07T19:45:22.2567820Z 2025-05-07T19:45:22.2567824Z 2025-05-07T19:45:22.2568097Z The following packages will be downloaded: 2025-05-07T19:45:22.2568320Z 2025-05-07T19:45:22.2568440Z package | build 2025-05-07T19:45:22.2568799Z ---------------------------|----------------- 2025-05-07T19:45:22.2569197Z alsa-lib-1.2.14 | hb9d3cd8_0 553 KB conda-forge 2025-05-07T19:45:22.2569667Z attrs-25.3.0 | pyh71513ae_0 56 KB conda-forge 2025-05-07T19:45:22.2570138Z auditwheel-6.2.0 | pyha804496_1 40 KB conda-forge 2025-05-07T19:45:22.2570566Z bazel-7.5.0 | h96810dc_2 47.4 MB conda-forge 2025-05-07T19:45:22.2571004Z c-ares-1.34.5 | hb9d3cd8_0 202 KB conda-forge 2025-05-07T19:45:22.2571413Z cairo-1.18.0 | hbb29018_2 961 KB conda-forge 2025-05-07T19:45:22.2571848Z click-8.1.8 | pyh707e725_0 83 KB conda-forge 2025-05-07T19:45:22.2572257Z cmake-4.0.2 | h74e3db0_0 19.4 MB conda-forge 2025-05-07T19:45:22.2572705Z distro-1.9.0 | pyhd8ed1ab_1 41 KB conda-forge 2025-05-07T19:45:22.2573378Z exceptiongroup-1.2.2 | pyhd8ed1ab_1 20 KB conda-forge 2025-05-07T19:45:22.2573904Z font-ttf-dejavu-sans-mono-2.37| hab24e00_0 388 KB conda-forge 2025-05-07T19:45:22.2574472Z font-ttf-inconsolata-3.000 | h77eed37_0 94 KB conda-forge 2025-05-07T19:45:22.2575001Z font-ttf-source-code-pro-2.038| h77eed37_0 684 KB conda-forge 2025-05-07T19:45:22.2575539Z font-ttf-ubuntu-0.83 | h77eed37_3 1.5 MB conda-forge 2025-05-07T19:45:22.2576029Z fontconfig-2.15.0 | h7e30c49_1 259 KB conda-forge 2025-05-07T19:45:22.2576506Z fonts-conda-ecosystem-1 | 0 4 KB conda-forge 2025-05-07T19:45:22.2577017Z fonts-conda-forge-1 | 0 4 KB conda-forge 2025-05-07T19:45:22.2577466Z freetype-2.13.3 | ha770c72_1 168 KB conda-forge 2025-05-07T19:45:22.2577907Z giflib-5.2.2 | hd590300_0 75 KB conda-forge 2025-05-07T19:45:22.2578335Z graphite2-1.3.13 | h59595ed_1003 95 KB conda-forge 2025-05-07T19:45:22.2578791Z harfbuzz-9.0.0 | hfac3d4d_0 1.5 MB conda-forge 2025-05-07T19:45:22.2579269Z hypothesis-6.131.14 | pyha770c72_0 348 KB conda-forge 2025-05-07T19:45:22.2579695Z ijar-7.5.0 | h5888daf_0 114 KB conda-forge 2025-05-07T19:45:22.2580135Z jinja2-3.1.6 | pyhd8ed1ab_0 110 KB conda-forge 2025-05-07T19:45:22.2580562Z keyutils-1.6.1 | h166bdaf_0 115 KB conda-forge 2025-05-07T19:45:22.2580994Z krb5-1.21.3 | h659f571_0 1.3 MB conda-forge 2025-05-07T19:45:22.2581388Z lcms2-2.17 | h717163a_0 242 KB conda-forge 2025-05-07T19:45:22.2581803Z lerc-4.0.0 | h0aef613_1 258 KB conda-forge 2025-05-07T19:45:22.2582269Z libabseil-20250127.1 | cxx17_hbbce691_0 1.3 MB conda-forge 2025-05-07T19:45:22.2582731Z libcups-2.3.3 | h4637d8d_4 4.3 MB conda-forge 2025-05-07T19:45:22.2583179Z libcurl-8.13.0 | h332b0f4_0 428 KB conda-forge 2025-05-07T19:45:22.2583612Z libdeflate-1.23 | h86f0d12_0 71 KB conda-forge 2025-05-07T19:45:22.2584105Z libedit-3.1.20250104 | pl5321h7949ede_0 132 KB conda-forge 2025-05-07T19:45:22.2584653Z libev-4.33 | hd590300_2 110 KB conda-forge 2025-05-07T19:45:22.2585078Z libexpat-2.7.0 | h5888daf_0 73 KB conda-forge 2025-05-07T19:45:22.2585602Z libfreetype-2.13.3 | ha770c72_1 8 KB conda-forge 2025-05-07T19:45:22.2586087Z libfreetype6-2.13.3 | h48d6fc4_1 371 KB conda-forge 2025-05-07T19:45:22.2586545Z libgfortran-15.1.0 | h69a702a_2 34 KB conda-forge 2025-05-07T19:45:22.2587037Z libgfortran5-15.1.0 | hcea5267_2 1.5 MB conda-forge 2025-05-07T19:45:22.2587507Z libglib-2.84.0 | h2ff4ddf_0 3.8 MB conda-forge 2025-05-07T19:45:22.2587937Z libgrpc-1.71.0 | h8e591d7_1 7.6 MB conda-forge 2025-05-07T19:45:22.2588414Z libjpeg-turbo-3.1.0 | hb9d3cd8_0 614 KB conda-forge 2025-05-07T19:45:22.2588861Z liblzma-5.8.1 | hb9d3cd8_1 110 KB conda-forge 2025-05-07T19:45:22.2589334Z liblzma-devel-5.8.1 | hb9d3cd8_1 431 KB conda-forge 2025-05-07T19:45:22.2589791Z libnghttp2-1.64.0 | h161d5f1_0 632 KB conda-forge 2025-05-07T19:45:22.2590246Z libnsl-2.0.1 | hd590300_0 33 KB conda-forge 2025-05-07T19:45:22.2590731Z libopenblas-0.3.29 |pthreads_h94d23a6_0 5.6 MB conda-forge 2025-05-07T19:45:22.2591185Z libpng-1.6.47 | h943b412_0 282 KB conda-forge 2025-05-07T19:45:22.2591763Z libprotobuf-5.29.3 | h501fc15_1 3.2 MB conda-forge 2025-05-07T19:45:22.2592220Z libre2-11-2024.07.02 | hba17884_3 205 KB conda-forge 2025-05-07T19:45:22.2592819Z libsqlite-3.49.2 | hee588c1_0 895 KB conda-forge 2025-05-07T19:45:22.2593483Z libssh2-1.11.1 | hcf80075_0 298 KB conda-forge 2025-05-07T19:45:22.2593939Z libtiff-4.7.0 | hd9ff511_4 419 KB conda-forge 2025-05-07T19:45:22.2594422Z libuuid-2.38.1 | h0b41bf4_0 33 KB conda-forge 2025-05-07T19:45:22.2594868Z libuv-1.50.0 | hb9d3cd8_0 870 KB conda-forge 2025-05-07T19:45:22.2595361Z libwebp-base-1.5.0 | h851e524_0 420 KB conda-forge 2025-05-07T19:45:22.2595837Z libxcb-1.17.0 | h8a09558_0 387 KB conda-forge 2025-05-07T19:45:22.2596317Z libzlib-1.3.1 | hb9d3cd8_2 60 KB conda-forge 2025-05-07T19:45:22.2596773Z make-4.4.1 | hb9d3cd8_2 501 KB conda-forge 2025-05-07T19:45:22.2597234Z markupsafe-3.0.2 | py310h89163eb_1 23 KB conda-forge 2025-05-07T19:45:22.2597728Z ncurses-6.5 | h2d0b736_3 871 KB conda-forge 2025-05-07T19:45:22.2598168Z ninja-1.12.1 | hff21bea_1 158 KB conda-forge 2025-05-07T19:45:22.2598673Z openblas-0.3.29 |pthreads_h6ec200e_0 5.8 MB conda-forge 2025-05-07T19:45:22.2599154Z openjdk-23.0.1 | h4c11d01_0 181.3 MB conda-forge 2025-05-07T19:45:22.2599643Z packaging-25.0 | pyh29332c3_1 61 KB conda-forge 2025-05-07T19:45:22.2600136Z patchelf-0.18.0 | h3f2d84a_2 133 KB conda-forge 2025-05-07T19:45:22.2600586Z pcre2-10.44 | hc749103_2 934 KB conda-forge 2025-05-07T19:45:22.2601054Z pixman-0.46.0 | h29eaf8c_0 389 KB conda-forge 2025-05-07T19:45:22.2601529Z pthread-stubs-0.4 | hb9d3cd8_1002 8 KB conda-forge 2025-05-07T19:45:22.2602050Z pyelftools-0.32 | pyh707e725_1 146 KB conda-forge 2025-05-07T19:45:22.2602556Z python-3.10.17 |hd6af730_0_cpython 23.9 MB conda-forge 2025-05-07T19:45:22.2603090Z pyyaml-6.0.2 | py310h89163eb_2 178 KB conda-forge 2025-05-07T19:45:22.2603562Z re2-2024.07.02 | h9925aae_3 26 KB conda-forge 2025-05-07T19:45:22.2604025Z rhash-1.4.5 | hb9d3cd8_0 183 KB conda-forge 2025-05-07T19:45:22.2604502Z scikit-build-0.18.1 | pyhae55e72_2 114 KB conda-forge 2025-05-07T19:45:22.2605011Z singlejar-7.5.0 | h0e684df_1 122 KB conda-forge 2025-05-07T19:45:22.2605604Z sortedcontainers-2.4.0 | pyhd8ed1ab_1 28 KB conda-forge 2025-05-07T19:45:22.2606066Z sqlite-3.49.2 | h9eae976_0 840 KB conda-forge 2025-05-07T19:45:22.2606461Z tk-8.6.13 |noxft_h4845f30_101 3.2 MB conda-forge 2025-05-07T19:45:22.2606870Z tomli-2.2.1 | pyhd8ed1ab_1 19 KB conda-forge 2025-05-07T19:45:22.2607304Z wheel-0.45.1 | pyhd8ed1ab_1 61 KB conda-forge 2025-05-07T19:45:22.2607730Z xorg-libice-1.1.2 | hb9d3cd8_0 57 KB conda-forge 2025-05-07T19:45:22.2608173Z xorg-libsm-1.2.6 | he73a12e_0 27 KB conda-forge 2025-05-07T19:45:22.2608598Z xorg-libx11-1.8.12 | h4f16b4b_0 816 KB conda-forge 2025-05-07T19:45:22.2609053Z xorg-libxau-1.0.12 | hb9d3cd8_0 14 KB conda-forge 2025-05-07T19:45:22.2609514Z xorg-libxdmcp-1.1.5 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:22.2610033Z xorg-libxext-1.3.6 | hb9d3cd8_0 49 KB conda-forge 2025-05-07T19:45:22.2610531Z xorg-libxfixes-6.0.1 | hb9d3cd8_0 19 KB conda-forge 2025-05-07T19:45:22.2610977Z xorg-libxi-1.8.2 | hb9d3cd8_0 46 KB conda-forge 2025-05-07T19:45:22.2611434Z xorg-libxrandr-1.5.4 | hb9d3cd8_0 29 KB conda-forge 2025-05-07T19:45:22.2611896Z xorg-libxrender-0.9.12 | hb9d3cd8_0 32 KB conda-forge 2025-05-07T19:45:22.2612363Z xorg-libxt-1.3.1 | hb9d3cd8_0 371 KB conda-forge 2025-05-07T19:45:22.2612819Z xorg-libxtst-1.2.5 | hb9d3cd8_3 32 KB conda-forge 2025-05-07T19:45:22.2613241Z xz-5.8.1 | hbcc6ac9_1 23 KB conda-forge 2025-05-07T19:45:22.2613693Z xz-gpl-tools-5.8.1 | hbcc6ac9_1 33 KB conda-forge 2025-05-07T19:45:22.2614137Z xz-tools-5.8.1 | hb9d3cd8_1 94 KB conda-forge 2025-05-07T19:45:22.2614576Z yaml-0.2.5 | h7f98852_2 87 KB conda-forge 2025-05-07T19:45:22.2614972Z zlib-1.3.1 | hb9d3cd8_2 90 KB conda-forge 2025-05-07T19:45:22.2615401Z zstd-1.5.7 | hb8e6e7a_2 554 KB conda-forge 2025-05-07T19:45:22.2615819Z ------------------------------------------------------------ 2025-05-07T19:45:22.2616186Z Total: 331.2 MB 2025-05-07T19:45:22.2616426Z 2025-05-07T19:45:22.2616561Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:22.2616784Z 2025-05-07T19:45:22.2616987Z alsa-lib conda-forge/linux-64::alsa-lib-1.2.14-hb9d3cd8_0 2025-05-07T19:45:22.2617455Z attrs conda-forge/noarch::attrs-25.3.0-pyh71513ae_0 2025-05-07T19:45:22.2617936Z auditwheel conda-forge/noarch::auditwheel-6.2.0-pyha804496_1 2025-05-07T19:45:22.2618390Z bazel conda-forge/linux-64::bazel-7.5.0-h96810dc_2 2025-05-07T19:45:22.2618813Z c-ares conda-forge/linux-64::c-ares-1.34.5-hb9d3cd8_0 2025-05-07T19:45:22.2619223Z cairo conda-forge/linux-64::cairo-1.18.0-hbb29018_2 2025-05-07T19:45:22.2619641Z click conda-forge/noarch::click-8.1.8-pyh707e725_0 2025-05-07T19:45:22.2620051Z cmake conda-forge/linux-64::cmake-4.0.2-h74e3db0_0 2025-05-07T19:45:22.2620518Z distro conda-forge/noarch::distro-1.9.0-pyhd8ed1ab_1 2025-05-07T19:45:22.2621018Z exceptiongroup conda-forge/noarch::exceptiongroup-1.2.2-pyhd8ed1ab_1 2025-05-07T19:45:22.2621605Z font-ttf-dejavu-s~ conda-forge/noarch::font-ttf-dejavu-sans-mono-2.37-hab24e00_0 2025-05-07T19:45:22.2622226Z font-ttf-inconsol~ conda-forge/noarch::font-ttf-inconsolata-3.000-h77eed37_0 2025-05-07T19:45:22.2622844Z font-ttf-source-c~ conda-forge/noarch::font-ttf-source-code-pro-2.038-h77eed37_0 2025-05-07T19:45:22.2623416Z font-ttf-ubuntu conda-forge/noarch::font-ttf-ubuntu-0.83-h77eed37_3 2025-05-07T19:45:22.2623942Z fontconfig conda-forge/linux-64::fontconfig-2.15.0-h7e30c49_1 2025-05-07T19:45:22.2624431Z fonts-conda-ecosy~ conda-forge/noarch::fonts-conda-ecosystem-1-0 2025-05-07T19:45:22.2624930Z fonts-conda-forge conda-forge/noarch::fonts-conda-forge-1-0 2025-05-07T19:45:22.2625403Z freetype conda-forge/linux-64::freetype-2.13.3-ha770c72_1 2025-05-07T19:45:22.2625839Z giflib conda-forge/linux-64::giflib-5.2.2-hd590300_0 2025-05-07T19:45:22.2626322Z graphite2 conda-forge/linux-64::graphite2-1.3.13-h59595ed_1003 2025-05-07T19:45:22.2626801Z harfbuzz conda-forge/linux-64::harfbuzz-9.0.0-hfac3d4d_0 2025-05-07T19:45:22.2627311Z hypothesis conda-forge/noarch::hypothesis-6.131.14-pyha770c72_0 2025-05-07T19:45:22.2627795Z ijar conda-forge/linux-64::ijar-7.5.0-h5888daf_0 2025-05-07T19:45:22.2628214Z jinja2 conda-forge/noarch::jinja2-3.1.6-pyhd8ed1ab_0 2025-05-07T19:45:22.2628784Z keyutils conda-forge/linux-64::keyutils-1.6.1-h166bdaf_0 2025-05-07T19:45:22.2629220Z krb5 conda-forge/linux-64::krb5-1.21.3-h659f571_0 2025-05-07T19:45:22.2629659Z lcms2 conda-forge/linux-64::lcms2-2.17-h717163a_0 2025-05-07T19:45:22.2630246Z lerc conda-forge/linux-64::lerc-4.0.0-h0aef613_1 2025-05-07T19:45:22.2630781Z libabseil conda-forge/linux-64::libabseil-20250127.1-cxx17_hbbce691_0 2025-05-07T19:45:22.2631779Z libcups conda-forge/linux-64::libcups-2.3.3-h4637d8d_4 2025-05-07T19:45:22.2632382Z libcurl conda-forge/linux-64::libcurl-8.13.0-h332b0f4_0 2025-05-07T19:45:22.2633026Z libdeflate conda-forge/linux-64::libdeflate-1.23-h86f0d12_0 2025-05-07T19:45:22.2633579Z libedit conda-forge/linux-64::libedit-3.1.20250104-pl5321h7949ede_0 2025-05-07T19:45:22.2634125Z libev conda-forge/linux-64::libev-4.33-hd590300_2 2025-05-07T19:45:22.2634629Z libexpat conda-forge/linux-64::libexpat-2.7.0-h5888daf_0 2025-05-07T19:45:22.2635149Z libfreetype conda-forge/linux-64::libfreetype-2.13.3-ha770c72_1 2025-05-07T19:45:22.2635734Z libfreetype6 conda-forge/linux-64::libfreetype6-2.13.3-h48d6fc4_1 2025-05-07T19:45:22.2636285Z libgfortran conda-forge/linux-64::libgfortran-15.1.0-h69a702a_2 2025-05-07T19:45:22.2636854Z libgfortran5 conda-forge/linux-64::libgfortran5-15.1.0-hcea5267_2 2025-05-07T19:45:22.2637400Z libglib conda-forge/linux-64::libglib-2.84.0-h2ff4ddf_0 2025-05-07T19:45:22.2637880Z libgrpc conda-forge/linux-64::libgrpc-1.71.0-h8e591d7_1 2025-05-07T19:45:22.2638428Z libjpeg-turbo conda-forge/linux-64::libjpeg-turbo-3.1.0-hb9d3cd8_0 2025-05-07T19:45:22.2638949Z liblzma conda-forge/linux-64::liblzma-5.8.1-hb9d3cd8_1 2025-05-07T19:45:22.2639489Z liblzma-devel conda-forge/linux-64::liblzma-devel-5.8.1-hb9d3cd8_1 2025-05-07T19:45:22.2640068Z libnghttp2 conda-forge/linux-64::libnghttp2-1.64.0-h161d5f1_0 2025-05-07T19:45:22.2640558Z libnsl conda-forge/linux-64::libnsl-2.0.1-hd590300_0 2025-05-07T19:45:22.2641129Z libopenblas conda-forge/linux-64::libopenblas-0.3.29-pthreads_h94d23a6_0 2025-05-07T19:45:22.2641671Z libpng conda-forge/linux-64::libpng-1.6.47-h943b412_0 2025-05-07T19:45:22.2642197Z libprotobuf conda-forge/linux-64::libprotobuf-5.29.3-h501fc15_1 2025-05-07T19:45:22.2642883Z libre2-11 conda-forge/linux-64::libre2-11-2024.07.02-hba17884_3 2025-05-07T19:45:22.2643397Z libsqlite conda-forge/linux-64::libsqlite-3.49.2-hee588c1_0 2025-05-07T19:45:22.2643919Z libssh2 conda-forge/linux-64::libssh2-1.11.1-hcf80075_0 2025-05-07T19:45:22.2644392Z libtiff conda-forge/linux-64::libtiff-4.7.0-hd9ff511_4 2025-05-07T19:45:22.2645012Z libuv conda-forge/linux-64::libuv-1.50.0-hb9d3cd8_0 2025-05-07T19:45:22.2645631Z libwebp-base conda-forge/linux-64::libwebp-base-1.5.0-h851e524_0 2025-05-07T19:45:22.2646104Z libxcb conda-forge/linux-64::libxcb-1.17.0-h8a09558_0 2025-05-07T19:45:22.2646547Z make conda-forge/linux-64::make-4.4.1-hb9d3cd8_2 2025-05-07T19:45:22.2647009Z markupsafe conda-forge/linux-64::markupsafe-3.0.2-py310h89163eb_1 2025-05-07T19:45:22.2647509Z ninja conda-forge/linux-64::ninja-1.12.1-hff21bea_1 2025-05-07T19:45:22.2648016Z openblas conda-forge/linux-64::openblas-0.3.29-pthreads_h6ec200e_0 2025-05-07T19:45:22.2648507Z openjdk conda-forge/linux-64::openjdk-23.0.1-h4c11d01_0 2025-05-07T19:45:22.2649013Z packaging conda-forge/noarch::packaging-25.0-pyh29332c3_1 2025-05-07T19:45:22.2649487Z patchelf conda-forge/linux-64::patchelf-0.18.0-h3f2d84a_2 2025-05-07T19:45:22.2649953Z pcre2 conda-forge/linux-64::pcre2-10.44-hc749103_2 2025-05-07T19:45:22.2650480Z pixman conda-forge/linux-64::pixman-0.46.0-h29eaf8c_0 2025-05-07T19:45:22.2650996Z pthread-stubs conda-forge/linux-64::pthread-stubs-0.4-hb9d3cd8_1002 2025-05-07T19:45:22.2651538Z pyelftools conda-forge/noarch::pyelftools-0.32-pyh707e725_1 2025-05-07T19:45:22.2652015Z pyyaml conda-forge/linux-64::pyyaml-6.0.2-py310h89163eb_2 2025-05-07T19:45:22.2652474Z re2 conda-forge/linux-64::re2-2024.07.02-h9925aae_3 2025-05-07T19:45:22.2652891Z rhash conda-forge/linux-64::rhash-1.4.5-hb9d3cd8_0 2025-05-07T19:45:22.2653600Z scikit-build conda-forge/noarch::scikit-build-0.18.1-pyhae55e72_2 2025-05-07T19:45:22.2654160Z singlejar conda-forge/linux-64::singlejar-7.5.0-h0e684df_1 2025-05-07T19:45:22.2654909Z sortedcontainers conda-forge/noarch::sortedcontainers-2.4.0-pyhd8ed1ab_1 2025-05-07T19:45:22.2655487Z tomli conda-forge/noarch::tomli-2.2.1-pyhd8ed1ab_1 2025-05-07T19:45:22.2655989Z xorg-libice conda-forge/linux-64::xorg-libice-1.1.2-hb9d3cd8_0 2025-05-07T19:45:22.2656541Z xorg-libsm conda-forge/linux-64::xorg-libsm-1.2.6-he73a12e_0 2025-05-07T19:45:22.2657113Z xorg-libx11 conda-forge/linux-64::xorg-libx11-1.8.12-h4f16b4b_0 2025-05-07T19:45:22.2657641Z xorg-libxau conda-forge/linux-64::xorg-libxau-1.0.12-hb9d3cd8_0 2025-05-07T19:45:22.2658236Z xorg-libxdmcp conda-forge/linux-64::xorg-libxdmcp-1.1.5-hb9d3cd8_0 2025-05-07T19:45:22.2658805Z xorg-libxext conda-forge/linux-64::xorg-libxext-1.3.6-hb9d3cd8_0 2025-05-07T19:45:22.2659401Z xorg-libxfixes conda-forge/linux-64::xorg-libxfixes-6.0.1-hb9d3cd8_0 2025-05-07T19:45:22.2659976Z xorg-libxi conda-forge/linux-64::xorg-libxi-1.8.2-hb9d3cd8_0 2025-05-07T19:45:22.2660526Z xorg-libxrandr conda-forge/linux-64::xorg-libxrandr-1.5.4-hb9d3cd8_0 2025-05-07T19:45:22.2661147Z xorg-libxrender conda-forge/linux-64::xorg-libxrender-0.9.12-hb9d3cd8_0 2025-05-07T19:45:22.2661718Z xorg-libxt conda-forge/linux-64::xorg-libxt-1.3.1-hb9d3cd8_0 2025-05-07T19:45:22.2662281Z xorg-libxtst conda-forge/linux-64::xorg-libxtst-1.2.5-hb9d3cd8_3 2025-05-07T19:45:22.2662860Z xz-gpl-tools conda-forge/linux-64::xz-gpl-tools-5.8.1-hbcc6ac9_1 2025-05-07T19:45:22.2663385Z xz-tools conda-forge/linux-64::xz-tools-5.8.1-hb9d3cd8_1 2025-05-07T19:45:22.2663869Z yaml conda-forge/linux-64::yaml-0.2.5-h7f98852_2 2025-05-07T19:45:22.2664210Z 2025-05-07T19:45:22.2664345Z The following packages will be UPDATED: 2025-05-07T19:45:22.2664612Z 2025-05-07T19:45:22.2664920Z libuuid pkgs/main::libuuid-1.41.5-h5eee18b_0 --> conda-forge::libuuid-2.38.1-h0b41bf4_0 2025-05-07T19:45:22.2665533Z libzlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:22.2666096Z ncurses pkgs/main::ncurses-6.4-h6a678d5_0 --> conda-forge::ncurses-6.5-h2d0b736_3 2025-05-07T19:45:22.2666841Z python pkgs/main::python-3.10.16-he870216_1 --> conda-forge::python-3.10.17-hd6af730_0_cpython 2025-05-07T19:45:22.2667580Z sqlite pkgs/main::sqlite-3.45.3-h5eee18b_0 --> conda-forge::sqlite-3.49.2-h9eae976_0 2025-05-07T19:45:22.2668297Z wheel pkgs/main/linux-64::wheel-0.45.1-py31~ --> conda-forge/noarch::wheel-0.45.1-pyhd8ed1ab_1 2025-05-07T19:45:22.2668988Z xz pkgs/main::xz-5.6.4-h5eee18b_1 --> conda-forge::xz-5.8.1-hbcc6ac9_1 2025-05-07T19:45:22.2669489Z zlib 1.2.13-h4ab18f5_6 --> 1.3.1-hb9d3cd8_2 2025-05-07T19:45:22.2669933Z zstd 1.5.6-ha6fb4c9_0 --> 1.5.7-hb8e6e7a_2 2025-05-07T19:45:22.2670196Z 2025-05-07T19:45:22.2670464Z The following packages will be SUPERSEDED by a higher-priority channel: 2025-05-07T19:45:22.2670818Z 2025-05-07T19:45:22.2671074Z tk pkgs/main::tk-8.6.14-h39e8969_0 --> conda-forge::tk-8.6.13-noxft_h4845f30_101 2025-05-07T19:45:22.2671465Z 2025-05-07T19:45:22.2671492Z 2025-05-07T19:45:22.2671561Z 2025-05-07T19:45:22.2671713Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:22.2672124Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:22.2672371Z 2025-05-07T19:45:22.2672781Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:22.2673043Z 2025-05-07T19:45:22.2673047Z 2025-05-07T19:45:22.2673275Z python-3.10.17 | 23.9 MB | | 0%  2025-05-07T19:45:22.2673536Z 2025-05-07T19:45:22.2673541Z 2025-05-07T19:45:22.2673545Z 2025-05-07T19:45:22.2684021Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:22.2684778Z 2025-05-07T19:45:22.2684788Z 2025-05-07T19:45:22.2684799Z 2025-05-07T19:45:22.2684809Z 2025-05-07T19:45:22.2700690Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:22.2701612Z 2025-05-07T19:45:22.2701625Z 2025-05-07T19:45:22.2701637Z 2025-05-07T19:45:22.2701647Z 2025-05-07T19:45:22.2701658Z 2025-05-07T19:45:22.2702406Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:22.2703253Z 2025-05-07T19:45:22.2703264Z 2025-05-07T19:45:22.2703274Z 2025-05-07T19:45:22.2703285Z 2025-05-07T19:45:22.2703295Z 2025-05-07T19:45:22.2703306Z 2025-05-07T19:45:22.2704058Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:22.2705063Z 2025-05-07T19:45:22.2705067Z 2025-05-07T19:45:22.2705077Z 2025-05-07T19:45:22.2705081Z 2025-05-07T19:45:22.2705085Z 2025-05-07T19:45:22.2705088Z 2025-05-07T19:45:22.2705092Z 2025-05-07T19:45:22.2705328Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:22.2705608Z 2025-05-07T19:45:22.2705612Z 2025-05-07T19:45:22.2705615Z 2025-05-07T19:45:22.2705636Z 2025-05-07T19:45:22.2705639Z 2025-05-07T19:45:22.2705643Z 2025-05-07T19:45:22.2705647Z 2025-05-07T19:45:22.2705650Z 2025-05-07T19:45:22.2705890Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:22.2706180Z 2025-05-07T19:45:22.2706183Z 2025-05-07T19:45:22.2706187Z 2025-05-07T19:45:22.2706190Z 2025-05-07T19:45:22.2706194Z 2025-05-07T19:45:22.2706214Z 2025-05-07T19:45:22.2706217Z 2025-05-07T19:45:22.2706221Z 2025-05-07T19:45:22.2706224Z 2025-05-07T19:45:22.2706490Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:22.2706792Z 2025-05-07T19:45:22.2707987Z 2025-05-07T19:45:22.2707992Z 2025-05-07T19:45:22.2707997Z 2025-05-07T19:45:22.2708001Z 2025-05-07T19:45:22.2708004Z 2025-05-07T19:45:22.2708027Z 2025-05-07T19:45:22.2708030Z 2025-05-07T19:45:22.2708034Z 2025-05-07T19:45:22.2708076Z 2025-05-07T19:45:22.2708347Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:22.2708632Z 2025-05-07T19:45:22.2708636Z 2025-05-07T19:45:22.2708639Z 2025-05-07T19:45:22.2708643Z 2025-05-07T19:45:22.2708646Z 2025-05-07T19:45:22.2708650Z 2025-05-07T19:45:22.2708653Z 2025-05-07T19:45:22.2708657Z 2025-05-07T19:45:22.2708666Z 2025-05-07T19:45:22.2708669Z 2025-05-07T19:45:22.2708673Z 2025-05-07T19:45:22.2708960Z font-ttf-ubuntu-0.83 | 1.5 MB | | 0%  2025-05-07T19:45:22.2709300Z 2025-05-07T19:45:22.2709304Z 2025-05-07T19:45:22.2709307Z 2025-05-07T19:45:22.2709311Z 2025-05-07T19:45:22.2709314Z 2025-05-07T19:45:22.2709318Z 2025-05-07T19:45:22.2709325Z 2025-05-07T19:45:22.2709329Z 2025-05-07T19:45:22.2709332Z 2025-05-07T19:45:22.2709336Z 2025-05-07T19:45:22.2709339Z 2025-05-07T19:45:22.2709343Z 2025-05-07T19:45:22.2709614Z harfbuzz-9.0.0 | 1.5 MB | | 0%  2025-05-07T19:45:22.2709939Z 2025-05-07T19:45:22.2709942Z 2025-05-07T19:45:22.2709946Z 2025-05-07T19:45:22.2709949Z 2025-05-07T19:45:22.2709953Z 2025-05-07T19:45:22.2709956Z 2025-05-07T19:45:22.2709960Z 2025-05-07T19:45:22.2709964Z 2025-05-07T19:45:22.2709967Z 2025-05-07T19:45:22.2709971Z 2025-05-07T19:45:22.2709974Z 2025-05-07T19:45:22.2710056Z 2025-05-07T19:45:22.2710060Z 2025-05-07T19:45:22.2710365Z libgfortran5-15.1.0 | 1.5 MB | | 0%  2025-05-07T19:45:22.2710725Z 2025-05-07T19:45:22.2710729Z 2025-05-07T19:45:22.2710732Z 2025-05-07T19:45:22.2710736Z 2025-05-07T19:45:22.2710739Z 2025-05-07T19:45:22.2710742Z 2025-05-07T19:45:22.2710746Z 2025-05-07T19:45:22.2710750Z 2025-05-07T19:45:22.2710757Z 2025-05-07T19:45:22.2710761Z 2025-05-07T19:45:22.2710764Z 2025-05-07T19:45:22.2710768Z 2025-05-07T19:45:22.2710771Z 2025-05-07T19:45:22.2710775Z 2025-05-07T19:45:22.2711065Z krb5-1.21.3 | 1.3 MB | | 0%  2025-05-07T19:45:22.2711359Z 2025-05-07T19:45:22.2711363Z 2025-05-07T19:45:22.2711367Z 2025-05-07T19:45:22.2711371Z 2025-05-07T19:45:22.2711374Z 2025-05-07T19:45:22.2711378Z 2025-05-07T19:45:22.2711381Z 2025-05-07T19:45:22.2711385Z 2025-05-07T19:45:22.2711388Z 2025-05-07T19:45:22.2711392Z 2025-05-07T19:45:22.2711399Z 2025-05-07T19:45:22.2711403Z 2025-05-07T19:45:22.2711431Z 2025-05-07T19:45:22.2711434Z 2025-05-07T19:45:22.2711438Z 2025-05-07T19:45:22.2711757Z libabseil-20250127.1 | 1.3 MB | | 0%  2025-05-07T19:45:22.2712106Z 2025-05-07T19:45:22.2712109Z 2025-05-07T19:45:22.2712113Z 2025-05-07T19:45:22.2712117Z 2025-05-07T19:45:22.2712124Z 2025-05-07T19:45:22.2712128Z 2025-05-07T19:45:22.2712131Z 2025-05-07T19:45:22.2712159Z 2025-05-07T19:45:22.2712162Z 2025-05-07T19:45:22.2712166Z 2025-05-07T19:45:22.2712169Z 2025-05-07T19:45:22.2712173Z 2025-05-07T19:45:22.2712176Z 2025-05-07T19:45:22.2712180Z 2025-05-07T19:45:22.2712183Z 2025-05-07T19:45:22.2712207Z 2025-05-07T19:45:22.2712614Z cairo-1.18.0 | 961 KB | | 0%  2025-05-07T19:45:22.2712965Z 2025-05-07T19:45:22.2712969Z 2025-05-07T19:45:22.2712972Z 2025-05-07T19:45:22.2712976Z 2025-05-07T19:45:22.2712983Z 2025-05-07T19:45:22.2712987Z 2025-05-07T19:45:22.2712991Z 2025-05-07T19:45:22.2712994Z 2025-05-07T19:45:22.2712998Z 2025-05-07T19:45:22.2713001Z 2025-05-07T19:45:22.2713005Z 2025-05-07T19:45:22.2713009Z 2025-05-07T19:45:22.2713012Z 2025-05-07T19:45:22.2713016Z 2025-05-07T19:45:22.2713019Z 2025-05-07T19:45:22.2713023Z 2025-05-07T19:45:22.2713027Z 2025-05-07T19:45:22.2713368Z pcre2-10.44 | 934 KB | | 0%  2025-05-07T19:45:22.2713775Z 2025-05-07T19:45:22.2713779Z 2025-05-07T19:45:22.2713783Z 2025-05-07T19:45:22.2713787Z 2025-05-07T19:45:22.2713790Z 2025-05-07T19:45:22.2713794Z 2025-05-07T19:45:22.2713797Z 2025-05-07T19:45:22.2713801Z 2025-05-07T19:45:22.2713804Z 2025-05-07T19:45:22.2713808Z 2025-05-07T19:45:22.2713811Z 2025-05-07T19:45:22.2713815Z 2025-05-07T19:45:22.2713818Z 2025-05-07T19:45:22.2713848Z 2025-05-07T19:45:22.2713851Z 2025-05-07T19:45:22.2713855Z 2025-05-07T19:45:22.2713858Z 2025-05-07T19:45:22.2713865Z 2025-05-07T19:45:22.2714185Z libsqlite-3.49.2 | 895 KB | | 0%  2025-05-07T19:45:22.2714525Z 2025-05-07T19:45:22.2714528Z 2025-05-07T19:45:22.2714532Z 2025-05-07T19:45:22.2714536Z 2025-05-07T19:45:22.2714539Z 2025-05-07T19:45:22.2714570Z 2025-05-07T19:45:22.2714573Z 2025-05-07T19:45:22.2714577Z 2025-05-07T19:45:22.2714584Z 2025-05-07T19:45:22.2714587Z 2025-05-07T19:45:22.2714591Z 2025-05-07T19:45:22.2714594Z 2025-05-07T19:45:22.2714598Z 2025-05-07T19:45:22.2714601Z 2025-05-07T19:45:22.2714604Z 2025-05-07T19:45:22.2714608Z 2025-05-07T19:45:22.2714611Z 2025-05-07T19:45:22.2714615Z 2025-05-07T19:45:22.2714619Z 2025-05-07T19:45:22.4638417Z ... (more hidden) ... 2025-05-07T19:45:22.4639399Z 2025-05-07T19:45:22.4639414Z 2025-05-07T19:45:22.4639425Z 2025-05-07T19:45:22.4639436Z 2025-05-07T19:45:22.5712852Z libgrpc-1.71.0 | 7.6 MB | | 0%  2025-05-07T19:45:22.5713203Z 2025-05-07T19:45:22.5713207Z 2025-05-07T19:45:22.5713210Z 2025-05-07T19:45:22.5713214Z 2025-05-07T19:45:22.5990907Z libgrpc-1.71.0 | 7.6 MB | | 1%  2025-05-07T19:45:22.5991792Z 2025-05-07T19:45:22.5991807Z 2025-05-07T19:45:22.6034921Z python-3.10.17 | 23.9 MB | | 0%  2025-05-07T19:45:22.6035840Z 2025-05-07T19:45:22.6035871Z 2025-05-07T19:45:22.6035883Z 2025-05-07T19:45:22.6224235Z cmake-4.0.2 | 19.4 MB | | 0%  2025-05-07T19:45:22.6472465Z openjdk-23.0.1 | 181.3 MB | | 0% 2025-05-07T19:45:22.6472883Z 2025-05-07T19:45:22.6713064Z bazel-7.5.0 | 47.4 MB | | 0%  2025-05-07T19:45:22.6713395Z 2025-05-07T19:45:22.6713399Z 2025-05-07T19:45:22.6713405Z 2025-05-07T19:45:22.6713409Z 2025-05-07T19:45:22.6992159Z libgrpc-1.71.0 | 7.6 MB | #####2 | 53%  2025-05-07T19:45:22.6993220Z 2025-05-07T19:45:22.6993270Z 2025-05-07T19:45:22.7035285Z python-3.10.17 | 23.9 MB | ##1 | 21%  2025-05-07T19:45:22.7035586Z 2025-05-07T19:45:22.7035590Z 2025-05-07T19:45:22.7036868Z 2025-05-07T19:45:22.7227917Z cmake-4.0.2 | 19.4 MB | ####2 | 42%  2025-05-07T19:45:22.7472682Z openjdk-23.0.1 | 181.3 MB | 3 | 3% 2025-05-07T19:45:22.7473497Z 2025-05-07T19:45:22.7994135Z bazel-7.5.0 | 47.4 MB | 6 | 7%  2025-05-07T19:45:22.7994450Z 2025-05-07T19:45:22.7994455Z 2025-05-07T19:45:22.8035006Z python-3.10.17 | 23.9 MB | ####5 | 45%  2025-05-07T19:45:22.8035329Z 2025-05-07T19:45:22.8035334Z 2025-05-07T19:45:22.8035337Z 2025-05-07T19:45:22.8227164Z cmake-4.0.2 | 19.4 MB | #######2 | 72%  2025-05-07T19:45:22.8474404Z openjdk-23.0.1 | 181.3 MB | 6 | 7% 2025-05-07T19:45:22.8474700Z 2025-05-07T19:45:22.8902889Z bazel-7.5.0 | 47.4 MB | #8 | 19%  2025-05-07T19:45:22.8903249Z 2025-05-07T19:45:22.8903253Z 2025-05-07T19:45:22.8903258Z 2025-05-07T19:45:22.8903261Z 2025-05-07T19:45:22.8903510Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:22.8903790Z 2025-05-07T19:45:22.8903794Z 2025-05-07T19:45:22.8903798Z 2025-05-07T19:45:22.8904990Z 2025-05-07T19:45:22.8994262Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:22.8994814Z 2025-05-07T19:45:22.8994819Z 2025-05-07T19:45:22.9228842Z python-3.10.17 | 23.9 MB | #######2 | 72%  2025-05-07T19:45:22.9462820Z openjdk-23.0.1 | 181.3 MB | # | 10% 2025-05-07T19:45:22.9463173Z 2025-05-07T19:45:22.9463338Z 2025-05-07T19:45:22.9463347Z 2025-05-07T19:45:22.9463353Z 2025-05-07T19:45:22.9463357Z 2025-05-07T19:45:22.9473881Z openblas-0.3.29 | 5.8 MB | | 0%  2025-05-07T19:45:22.9474203Z 2025-05-07T19:45:22.9994673Z bazel-7.5.0 | 47.4 MB | ###1 | 31%  2025-05-07T19:45:22.9994977Z 2025-05-07T19:45:22.9994982Z 2025-05-07T19:45:23.0231160Z python-3.10.17 | 23.9 MB | #########7 | 97%  2025-05-07T19:45:23.0467409Z openjdk-23.0.1 | 181.3 MB | #3 | 14% 2025-05-07T19:45:23.0467912Z 2025-05-07T19:45:23.0467920Z 2025-05-07T19:45:23.0467925Z 2025-05-07T19:45:23.0467935Z 2025-05-07T19:45:23.0467939Z 2025-05-07T19:45:23.0475368Z openblas-0.3.29 | 5.8 MB | ######7 | 68%  2025-05-07T19:45:23.0475797Z 2025-05-07T19:45:23.1230986Z bazel-7.5.0 | 47.4 MB | ####2 | 43%  2025-05-07T19:45:23.1235637Z openjdk-23.0.1 | 181.3 MB | #8 | 19% 2025-05-07T19:45:23.1235896Z 2025-05-07T19:45:23.1235901Z 2025-05-07T19:45:23.1235928Z 2025-05-07T19:45:23.1235931Z 2025-05-07T19:45:23.1235940Z 2025-05-07T19:45:23.1373641Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:23.1374024Z 2025-05-07T19:45:23.1374030Z 2025-05-07T19:45:23.1374035Z 2025-05-07T19:45:23.1374576Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:23.1374850Z 2025-05-07T19:45:23.1374853Z 2025-05-07T19:45:23.1374858Z 2025-05-07T19:45:23.1716164Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:23.1716538Z 2025-05-07T19:45:23.1716544Z 2025-05-07T19:45:23.1716549Z 2025-05-07T19:45:23.1716554Z 2025-05-07T19:45:23.1716581Z 2025-05-07T19:45:23.1716585Z 2025-05-07T19:45:23.1735468Z libopenblas-0.3.29 | 5.6 MB | | 0%  2025-05-07T19:45:23.1735815Z 2025-05-07T19:45:23.1735819Z 2025-05-07T19:45:23.1735823Z 2025-05-07T19:45:23.1735827Z 2025-05-07T19:45:23.1735830Z 2025-05-07T19:45:23.1735834Z 2025-05-07T19:45:23.1735843Z 2025-05-07T19:45:23.2172416Z libcups-2.3.3 | 4.3 MB | | 0%  2025-05-07T19:45:23.2172734Z 2025-05-07T19:45:23.2513933Z bazel-7.5.0 | 47.4 MB | #####3 | 53%  2025-05-07T19:45:23.2716300Z openjdk-23.0.1 | 181.3 MB | ##2 | 22% 2025-05-07T19:45:23.2716587Z 2025-05-07T19:45:23.2716618Z 2025-05-07T19:45:23.2716623Z 2025-05-07T19:45:23.2716627Z 2025-05-07T19:45:23.2716630Z 2025-05-07T19:45:23.2716640Z 2025-05-07T19:45:23.3361100Z libopenblas-0.3.29 | 5.6 MB | ######1 | 61%  2025-05-07T19:45:23.3362041Z 2025-05-07T19:45:23.3362053Z 2025-05-07T19:45:23.3362064Z 2025-05-07T19:45:23.3362107Z 2025-05-07T19:45:23.3362147Z 2025-05-07T19:45:23.3362158Z 2025-05-07T19:45:23.3362192Z 2025-05-07T19:45:23.3362918Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:23.3363747Z 2025-05-07T19:45:23.3363758Z 2025-05-07T19:45:23.3363769Z 2025-05-07T19:45:23.3363779Z 2025-05-07T19:45:23.3363789Z 2025-05-07T19:45:23.3363829Z 2025-05-07T19:45:23.3363839Z 2025-05-07T19:45:23.3394906Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:23.3395230Z 2025-05-07T19:45:23.3395235Z 2025-05-07T19:45:23.3513925Z python-3.10.17 | 23.9 MB | ########## | 100%  2025-05-07T19:45:23.3762172Z openjdk-23.0.1 | 181.3 MB | ##6 | 27% 2025-05-07T19:45:23.3762987Z 2025-05-07T19:45:23.3763000Z 2025-05-07T19:45:23.3763011Z 2025-05-07T19:45:23.3763021Z 2025-05-07T19:45:23.3763032Z 2025-05-07T19:45:23.3763043Z 2025-05-07T19:45:23.3763053Z 2025-05-07T19:45:23.3763064Z 2025-05-07T19:45:23.3763834Z 2025-05-07T19:45:23.3768549Z libprotobuf-5.29.3 | 3.2 MB | | 0%  2025-05-07T19:45:23.3768887Z 2025-05-07T19:45:23.3768891Z 2025-05-07T19:45:23.3768919Z 2025-05-07T19:45:23.3768922Z 2025-05-07T19:45:23.3768926Z 2025-05-07T19:45:23.3768930Z 2025-05-07T19:45:23.3768933Z 2025-05-07T19:45:23.3769347Z 2025-05-07T19:45:23.3774541Z libglib-2.84.0 | 3.8 MB | | 0%  2025-05-07T19:45:23.3774868Z 2025-05-07T19:45:23.3774872Z 2025-05-07T19:45:23.3774876Z 2025-05-07T19:45:23.3774879Z 2025-05-07T19:45:23.3774894Z 2025-05-07T19:45:23.3774906Z 2025-05-07T19:45:23.3775178Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:23.3775485Z 2025-05-07T19:45:23.3775510Z 2025-05-07T19:45:23.3775513Z 2025-05-07T19:45:23.3775517Z 2025-05-07T19:45:23.3775520Z 2025-05-07T19:45:23.3775527Z 2025-05-07T19:45:23.4101693Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:23.4102220Z 2025-05-07T19:45:23.4136001Z bazel-7.5.0 | 47.4 MB | ######2 | 62%  2025-05-07T19:45:23.4136315Z 2025-05-07T19:45:23.4136320Z 2025-05-07T19:45:23.4136323Z 2025-05-07T19:45:23.4136327Z 2025-05-07T19:45:23.4136331Z 2025-05-07T19:45:23.4136334Z 2025-05-07T19:45:23.4136338Z 2025-05-07T19:45:23.4136341Z 2025-05-07T19:45:23.4136345Z 2025-05-07T19:45:23.4136348Z 2025-05-07T19:45:23.4741197Z tk-8.6.13 | 3.2 MB | | 0%  2025-05-07T19:45:23.4920307Z openjdk-23.0.1 | 181.3 MB | ### | 30% 2025-05-07T19:45:23.4920615Z 2025-05-07T19:45:23.4920620Z 2025-05-07T19:45:23.4920624Z 2025-05-07T19:45:23.4920628Z 2025-05-07T19:45:23.4920631Z 2025-05-07T19:45:23.4920635Z 2025-05-07T19:45:23.4920638Z 2025-05-07T19:45:23.4920642Z 2025-05-07T19:45:23.4920645Z 2025-05-07T19:45:23.4922039Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.4922377Z 2025-05-07T19:45:23.4922381Z 2025-05-07T19:45:23.4922384Z 2025-05-07T19:45:23.4922388Z 2025-05-07T19:45:23.4922391Z 2025-05-07T19:45:23.4922395Z 2025-05-07T19:45:23.4922398Z 2025-05-07T19:45:23.4922406Z 2025-05-07T19:45:23.4922409Z 2025-05-07T19:45:23.5048708Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.5049074Z 2025-05-07T19:45:23.5049079Z 2025-05-07T19:45:23.5049082Z 2025-05-07T19:45:23.5049086Z 2025-05-07T19:45:23.5049089Z 2025-05-07T19:45:23.5049093Z 2025-05-07T19:45:23.5049096Z 2025-05-07T19:45:23.5049100Z 2025-05-07T19:45:23.5049405Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:23.5049700Z 2025-05-07T19:45:23.5049704Z 2025-05-07T19:45:23.5049707Z 2025-05-07T19:45:23.5049711Z 2025-05-07T19:45:23.5049714Z 2025-05-07T19:45:23.5049718Z 2025-05-07T19:45:23.5049721Z 2025-05-07T19:45:23.5049911Z 2025-05-07T19:45:23.5193186Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:23.5193600Z 2025-05-07T19:45:23.5193604Z 2025-05-07T19:45:23.5193608Z 2025-05-07T19:45:23.5193611Z 2025-05-07T19:45:23.5193615Z 2025-05-07T19:45:23.5193618Z 2025-05-07T19:45:23.5193622Z 2025-05-07T19:45:23.5193625Z 2025-05-07T19:45:23.5193629Z 2025-05-07T19:45:23.5193632Z 2025-05-07T19:45:23.5193881Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.5194242Z 2025-05-07T19:45:23.5194246Z 2025-05-07T19:45:23.5194250Z 2025-05-07T19:45:23.5194253Z 2025-05-07T19:45:23.5194257Z 2025-05-07T19:45:23.5194268Z 2025-05-07T19:45:23.5194271Z 2025-05-07T19:45:23.5194275Z 2025-05-07T19:45:23.5194278Z 2025-05-07T19:45:23.5194282Z 2025-05-07T19:45:23.5240476Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:23.5240818Z 2025-05-07T19:45:23.5292832Z bazel-7.5.0 | 47.4 MB | ######9 | 70%  2025-05-07T19:45:23.5293123Z 2025-05-07T19:45:23.5293303Z 2025-05-07T19:45:23.5293497Z 2025-05-07T19:45:23.5293502Z 2025-05-07T19:45:23.5293506Z 2025-05-07T19:45:23.5293511Z 2025-05-07T19:45:23.5293516Z 2025-05-07T19:45:23.5293521Z 2025-05-07T19:45:23.5293525Z 2025-05-07T19:45:23.5293530Z 2025-05-07T19:45:23.5293540Z 2025-05-07T19:45:23.5538407Z font-ttf-ubuntu-0.83 | 1.5 MB | 1 | 1%  2025-05-07T19:45:23.5538788Z 2025-05-07T19:45:23.5538792Z 2025-05-07T19:45:23.5538796Z 2025-05-07T19:45:23.5538799Z 2025-05-07T19:45:23.5538804Z 2025-05-07T19:45:23.5538808Z 2025-05-07T19:45:23.5538812Z 2025-05-07T19:45:23.5538886Z 2025-05-07T19:45:23.5538889Z 2025-05-07T19:45:23.5538893Z 2025-05-07T19:45:23.5538896Z 2025-05-07T19:45:23.5538899Z 2025-05-07T19:45:23.5720337Z harfbuzz-9.0.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:23.5720685Z 2025-05-07T19:45:23.5720690Z 2025-05-07T19:45:23.5720694Z 2025-05-07T19:45:23.5720698Z 2025-05-07T19:45:23.5720701Z 2025-05-07T19:45:23.5720725Z 2025-05-07T19:45:23.5720728Z 2025-05-07T19:45:23.5720732Z 2025-05-07T19:45:23.5720735Z 2025-05-07T19:45:23.5720739Z 2025-05-07T19:45:23.5720742Z 2025-05-07T19:45:23.5720746Z 2025-05-07T19:45:23.5720750Z 2025-05-07T19:45:23.5742248Z libgfortran5-15.1.0 | 1.5 MB | 1 | 1%  2025-05-07T19:45:23.5784878Z openjdk-23.0.1 | 181.3 MB | ###4 | 35% 2025-05-07T19:45:23.5785202Z 2025-05-07T19:45:23.5785207Z 2025-05-07T19:45:23.5785211Z 2025-05-07T19:45:23.5785214Z 2025-05-07T19:45:23.5785218Z 2025-05-07T19:45:23.5785428Z 2025-05-07T19:45:23.5785433Z 2025-05-07T19:45:23.5785437Z 2025-05-07T19:45:23.5785440Z 2025-05-07T19:45:23.5785444Z 2025-05-07T19:45:23.5785447Z 2025-05-07T19:45:23.6189676Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.6190044Z 2025-05-07T19:45:23.6190048Z 2025-05-07T19:45:23.6190052Z 2025-05-07T19:45:23.6190055Z 2025-05-07T19:45:23.6190076Z 2025-05-07T19:45:23.6190080Z 2025-05-07T19:45:23.6190084Z 2025-05-07T19:45:23.6190087Z 2025-05-07T19:45:23.6190091Z 2025-05-07T19:45:23.6190094Z 2025-05-07T19:45:23.6190098Z 2025-05-07T19:45:23.6190102Z 2025-05-07T19:45:23.6190127Z 2025-05-07T19:45:23.6190130Z 2025-05-07T19:45:23.6241252Z krb5-1.21.3 | 1.3 MB | 1 | 1%  2025-05-07T19:45:23.6241582Z 2025-05-07T19:45:23.6339478Z bazel-7.5.0 | 47.4 MB | ######## | 80%  2025-05-07T19:45:23.6339763Z 2025-05-07T19:45:23.6339767Z 2025-05-07T19:45:23.6339771Z 2025-05-07T19:45:23.6339793Z 2025-05-07T19:45:23.6339797Z 2025-05-07T19:45:23.6339801Z 2025-05-07T19:45:23.6339804Z 2025-05-07T19:45:23.6339808Z 2025-05-07T19:45:23.6339811Z 2025-05-07T19:45:23.6339815Z 2025-05-07T19:45:23.6339818Z 2025-05-07T19:45:23.6339822Z 2025-05-07T19:45:23.6339826Z 2025-05-07T19:45:23.6635743Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.6636119Z 2025-05-07T19:45:23.6636123Z 2025-05-07T19:45:23.6636127Z 2025-05-07T19:45:23.6636131Z 2025-05-07T19:45:23.6636134Z 2025-05-07T19:45:23.6636138Z 2025-05-07T19:45:23.6636142Z 2025-05-07T19:45:23.6636146Z 2025-05-07T19:45:23.6636175Z 2025-05-07T19:45:23.6636179Z 2025-05-07T19:45:23.6636182Z 2025-05-07T19:45:23.6636186Z 2025-05-07T19:45:23.6636189Z 2025-05-07T19:45:23.6636198Z 2025-05-07T19:45:23.6694496Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:23.6694860Z 2025-05-07T19:45:23.6694972Z 2025-05-07T19:45:23.6694997Z 2025-05-07T19:45:23.6695002Z 2025-05-07T19:45:23.6695007Z 2025-05-07T19:45:23.6695011Z 2025-05-07T19:45:23.6695016Z 2025-05-07T19:45:23.6695022Z 2025-05-07T19:45:23.6695066Z 2025-05-07T19:45:23.6695071Z 2025-05-07T19:45:23.6695076Z 2025-05-07T19:45:23.6696644Z 2025-05-07T19:45:23.6697167Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.6698321Z 2025-05-07T19:45:23.6698325Z 2025-05-07T19:45:23.6698329Z 2025-05-07T19:45:23.6698332Z 2025-05-07T19:45:23.6698336Z 2025-05-07T19:45:23.6698350Z 2025-05-07T19:45:23.6698354Z 2025-05-07T19:45:23.6698357Z 2025-05-07T19:45:23.6698361Z 2025-05-07T19:45:23.6698364Z 2025-05-07T19:45:23.6698389Z 2025-05-07T19:45:23.6698392Z 2025-05-07T19:45:23.6850650Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:23.6851166Z openjdk-23.0.1 | 181.3 MB | ###8 | 38% 2025-05-07T19:45:23.6851426Z 2025-05-07T19:45:23.6851448Z 2025-05-07T19:45:23.6851453Z 2025-05-07T19:45:23.6851456Z 2025-05-07T19:45:23.6851460Z 2025-05-07T19:45:23.6851463Z 2025-05-07T19:45:23.6851467Z 2025-05-07T19:45:23.6851493Z 2025-05-07T19:45:23.6851496Z 2025-05-07T19:45:23.6851500Z 2025-05-07T19:45:23.6851504Z 2025-05-07T19:45:23.6851507Z 2025-05-07T19:45:23.6851511Z 2025-05-07T19:45:23.6851514Z 2025-05-07T19:45:23.6851518Z 2025-05-07T19:45:23.7075698Z libabseil-20250127.1 | 1.3 MB | 1 | 1%  2025-05-07T19:45:23.7076083Z 2025-05-07T19:45:23.7076087Z 2025-05-07T19:45:23.7076091Z 2025-05-07T19:45:23.7076094Z 2025-05-07T19:45:23.7076098Z 2025-05-07T19:45:23.7076101Z 2025-05-07T19:45:23.7076104Z 2025-05-07T19:45:23.7076108Z 2025-05-07T19:45:23.7076111Z 2025-05-07T19:45:23.7076116Z 2025-05-07T19:45:23.7076142Z 2025-05-07T19:45:23.7076146Z 2025-05-07T19:45:23.7076149Z 2025-05-07T19:45:23.7076153Z 2025-05-07T19:45:23.7076156Z 2025-05-07T19:45:23.7076160Z 2025-05-07T19:45:23.7076366Z 2025-05-07T19:45:23.7133617Z pcre2-10.44 | 934 KB | 1 | 2%  2025-05-07T19:45:23.7134513Z 2025-05-07T19:45:23.7134518Z 2025-05-07T19:45:23.7134521Z 2025-05-07T19:45:23.7134525Z 2025-05-07T19:45:23.7134528Z 2025-05-07T19:45:23.7134532Z 2025-05-07T19:45:23.7134535Z 2025-05-07T19:45:23.7134539Z 2025-05-07T19:45:23.7134559Z 2025-05-07T19:45:23.7134563Z 2025-05-07T19:45:23.7134566Z 2025-05-07T19:45:23.7134570Z 2025-05-07T19:45:23.7134573Z 2025-05-07T19:45:23.7134577Z 2025-05-07T19:45:23.7134580Z 2025-05-07T19:45:23.7134584Z 2025-05-07T19:45:23.7180809Z cairo-1.18.0 | 961 KB | 1 | 2%  2025-05-07T19:45:23.7181812Z 2025-05-07T19:45:23.7181826Z 2025-05-07T19:45:23.7181837Z 2025-05-07T19:45:23.7181847Z 2025-05-07T19:45:23.7249805Z libgrpc-1.71.0 | 7.6 MB | ########## | 100%  2025-05-07T19:45:23.7250482Z 2025-05-07T19:45:23.7522770Z bazel-7.5.0 | 47.4 MB | #########1 | 92%  2025-05-07T19:45:23.7523069Z 2025-05-07T19:45:23.7523073Z 2025-05-07T19:45:23.7523076Z 2025-05-07T19:45:23.7523080Z 2025-05-07T19:45:23.7523083Z 2025-05-07T19:45:23.7523113Z 2025-05-07T19:45:23.7523116Z 2025-05-07T19:45:23.7523120Z 2025-05-07T19:45:23.7523123Z 2025-05-07T19:45:23.7523127Z 2025-05-07T19:45:23.7523130Z 2025-05-07T19:45:23.7523142Z 2025-05-07T19:45:23.7523145Z 2025-05-07T19:45:23.7523149Z 2025-05-07T19:45:23.7523152Z 2025-05-07T19:45:23.7583873Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:23.7584281Z 2025-05-07T19:45:23.7584285Z 2025-05-07T19:45:23.7584289Z 2025-05-07T19:45:23.7584365Z 2025-05-07T19:45:23.7584375Z 2025-05-07T19:45:23.7584381Z 2025-05-07T19:45:23.7584386Z 2025-05-07T19:45:23.7584390Z 2025-05-07T19:45:23.7584395Z 2025-05-07T19:45:23.7584399Z 2025-05-07T19:45:23.7584404Z 2025-05-07T19:45:23.7584408Z 2025-05-07T19:45:23.7584430Z 2025-05-07T19:45:23.7584435Z 2025-05-07T19:45:23.7584439Z 2025-05-07T19:45:23.7584444Z 2025-05-07T19:45:23.7623289Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:23.7623638Z 2025-05-07T19:45:23.7623643Z 2025-05-07T19:45:23.7623646Z 2025-05-07T19:45:23.7623650Z 2025-05-07T19:45:23.7623653Z 2025-05-07T19:45:23.7623880Z 2025-05-07T19:45:23.7623884Z 2025-05-07T19:45:23.7623887Z 2025-05-07T19:45:23.7623899Z 2025-05-07T19:45:23.7623902Z 2025-05-07T19:45:23.7623932Z 2025-05-07T19:45:23.7623935Z 2025-05-07T19:45:23.7623939Z 2025-05-07T19:45:23.7623943Z 2025-05-07T19:45:23.7623946Z 2025-05-07T19:45:23.7623950Z 2025-05-07T19:45:23.7623953Z 2025-05-07T19:45:23.7975933Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:23.7976358Z 2025-05-07T19:45:23.7976363Z 2025-05-07T19:45:23.7976369Z 2025-05-07T19:45:23.7976373Z 2025-05-07T19:45:23.7976394Z 2025-05-07T19:45:23.7976399Z 2025-05-07T19:45:23.7976403Z 2025-05-07T19:45:23.7976408Z 2025-05-07T19:45:23.7976412Z 2025-05-07T19:45:23.7976417Z 2025-05-07T19:45:23.7976421Z 2025-05-07T19:45:23.7976426Z 2025-05-07T19:45:23.7976430Z 2025-05-07T19:45:23.7976435Z 2025-05-07T19:45:23.7976440Z 2025-05-07T19:45:23.7976444Z 2025-05-07T19:45:23.7976449Z 2025-05-07T19:45:23.7976453Z 2025-05-07T19:45:23.7991226Z libsqlite-3.49.2 | 895 KB | 1 | 2%  2025-05-07T19:45:23.8012982Z openjdk-23.0.1 | 181.3 MB | ####2 | 42% 2025-05-07T19:45:23.8013410Z 2025-05-07T19:45:23.8013415Z 2025-05-07T19:45:23.8013420Z 2025-05-07T19:45:23.8013424Z 2025-05-07T19:45:23.8013429Z 2025-05-07T19:45:23.8013433Z 2025-05-07T19:45:23.8013439Z 2025-05-07T19:45:23.8013443Z 2025-05-07T19:45:23.8013448Z 2025-05-07T19:45:23.8013452Z 2025-05-07T19:45:23.8013457Z 2025-05-07T19:45:23.8013536Z 2025-05-07T19:45:23.8013539Z 2025-05-07T19:45:23.8013805Z 2025-05-07T19:45:23.8013813Z 2025-05-07T19:45:23.8013824Z 2025-05-07T19:45:23.8013830Z 2025-05-07T19:45:23.8013836Z 2025-05-07T19:45:23.8013841Z 2025-05-07T19:45:23.8298553Z ... (more hidden) ... 2025-05-07T19:45:23.8298895Z 2025-05-07T19:45:23.8298899Z 2025-05-07T19:45:23.8298903Z 2025-05-07T19:45:23.8298930Z 2025-05-07T19:45:23.8298948Z 2025-05-07T19:45:23.8298952Z 2025-05-07T19:45:23.8298955Z 2025-05-07T19:45:23.8298959Z 2025-05-07T19:45:23.8298962Z 2025-05-07T19:45:23.8298966Z 2025-05-07T19:45:23.8298969Z 2025-05-07T19:45:23.8298973Z 2025-05-07T19:45:23.8298976Z 2025-05-07T19:45:23.8298986Z 2025-05-07T19:45:23.8298989Z 2025-05-07T19:45:23.8298993Z 2025-05-07T19:45:23.8298996Z 2025-05-07T19:45:23.8299000Z 2025-05-07T19:45:23.8299003Z 2025-05-07T19:45:23.8314327Z ... (more hidden) ... 2025-05-07T19:45:23.8314661Z 2025-05-07T19:45:23.8314665Z 2025-05-07T19:45:23.8314683Z 2025-05-07T19:45:23.8314686Z 2025-05-07T19:45:23.8314690Z 2025-05-07T19:45:23.8314717Z 2025-05-07T19:45:23.8314721Z 2025-05-07T19:45:23.8314725Z 2025-05-07T19:45:23.8314728Z 2025-05-07T19:45:23.8314732Z 2025-05-07T19:45:23.8314736Z 2025-05-07T19:45:23.8314739Z 2025-05-07T19:45:23.8314743Z 2025-05-07T19:45:23.8314747Z 2025-05-07T19:45:23.8314750Z 2025-05-07T19:45:23.8314754Z 2025-05-07T19:45:23.8314766Z 2025-05-07T19:45:23.8314770Z 2025-05-07T19:45:23.8992370Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:23.9059471Z openjdk-23.0.1 | 181.3 MB | ####6 | 46% 2025-05-07T19:45:23.9059798Z 2025-05-07T19:45:23.9059872Z 2025-05-07T19:45:23.9059996Z 2025-05-07T19:45:23.9060005Z 2025-05-07T19:45:23.9060009Z 2025-05-07T19:45:23.9663374Z openblas-0.3.29 | 5.8 MB | ########## | 100%  2025-05-07T19:45:24.0178747Z 2025-05-07T19:45:24.0178754Z 2025-05-07T19:45:24.0178780Z 2025-05-07T19:45:24.0178785Z 2025-05-07T19:45:24.0178790Z 2025-05-07T19:45:24.0178795Z 2025-05-07T19:45:24.0178800Z 2025-05-07T19:45:24.0179254Z libcups-2.3.3 | 4.3 MB | ########## | 100%  2025-05-07T19:45:24.1182058Z openjdk-23.0.1 | 181.3 MB | ##### | 50% 2025-05-07T19:45:24.2183024Z openjdk-23.0.1 | 181.3 MB | #####4 | 55% 2025-05-07T19:45:24.3197546Z openjdk-23.0.1 | 181.3 MB | #####9 | 59% 2025-05-07T19:45:24.3815586Z openjdk-23.0.1 | 181.3 MB | ######3 | 64% 2025-05-07T19:45:24.3815941Z 2025-05-07T19:45:24.3816066Z 2025-05-07T19:45:24.3816073Z 2025-05-07T19:45:24.3816081Z 2025-05-07T19:45:24.3816087Z 2025-05-07T19:45:24.3816091Z 2025-05-07T19:45:24.4198916Z libopenblas-0.3.29 | 5.6 MB | ########## | 100%  2025-05-07T19:45:24.5714556Z openjdk-23.0.1 | 181.3 MB | ######7 | 68% 2025-05-07T19:45:24.5714852Z 2025-05-07T19:45:24.5799291Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:24.6801552Z openjdk-23.0.1 | 181.3 MB | #######1 | 72% 2025-05-07T19:45:24.7805869Z openjdk-23.0.1 | 181.3 MB | #######6 | 77% 2025-05-07T19:45:24.8913816Z openjdk-23.0.1 | 181.3 MB | ########1 | 82% 2025-05-07T19:45:24.8914223Z 2025-05-07T19:45:24.8914228Z 2025-05-07T19:45:24.8914234Z 2025-05-07T19:45:24.8914239Z 2025-05-07T19:45:24.8914268Z 2025-05-07T19:45:24.8914272Z 2025-05-07T19:45:24.8914277Z 2025-05-07T19:45:24.8914281Z 2025-05-07T19:45:24.8914286Z 2025-05-07T19:45:24.9317165Z libprotobuf-5.29.3 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.0321527Z openjdk-23.0.1 | 181.3 MB | ########5 | 86% 2025-05-07T19:45:25.1323003Z openjdk-23.0.1 | 181.3 MB | #########1 | 91% 2025-05-07T19:45:25.1790467Z openjdk-23.0.1 | 181.3 MB | #########7 | 97% 2025-05-07T19:45:25.1791320Z 2025-05-07T19:45:25.1791334Z 2025-05-07T19:45:25.1791345Z 2025-05-07T19:45:25.1791356Z 2025-05-07T19:45:25.1791805Z 2025-05-07T19:45:25.1791818Z 2025-05-07T19:45:25.1791829Z 2025-05-07T19:45:25.1791839Z 2025-05-07T19:45:25.5312272Z libglib-2.84.0 | 3.8 MB | ########## | 100%  2025-05-07T19:45:25.5312768Z 2025-05-07T19:45:25.5312773Z 2025-05-07T19:45:25.5312777Z 2025-05-07T19:45:25.5312782Z 2025-05-07T19:45:25.5312785Z 2025-05-07T19:45:25.5312790Z 2025-05-07T19:45:25.5312815Z 2025-05-07T19:45:25.5312819Z 2025-05-07T19:45:25.5312822Z 2025-05-07T19:45:25.5312826Z 2025-05-07T19:45:25.5756270Z tk-8.6.13 | 3.2 MB | ########## | 100%  2025-05-07T19:45:25.5756601Z 2025-05-07T19:45:25.5756606Z 2025-05-07T19:45:25.5756609Z 2025-05-07T19:45:25.5756613Z 2025-05-07T19:45:25.5756617Z 2025-05-07T19:45:25.5756620Z 2025-05-07T19:45:25.5756624Z 2025-05-07T19:45:25.5756627Z 2025-05-07T19:45:25.5756631Z 2025-05-07T19:45:25.5756634Z 2025-05-07T19:45:25.5756638Z 2025-05-07T19:45:25.5763325Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.5763710Z 2025-05-07T19:45:25.5763715Z 2025-05-07T19:45:25.5763718Z 2025-05-07T19:45:25.5763722Z 2025-05-07T19:45:25.5763726Z 2025-05-07T19:45:25.5763729Z 2025-05-07T19:45:25.5763733Z 2025-05-07T19:45:25.5763736Z 2025-05-07T19:45:25.5763740Z 2025-05-07T19:45:25.5763743Z 2025-05-07T19:45:25.5763780Z 2025-05-07T19:45:25.6526793Z font-ttf-ubuntu-0.83 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.6527233Z 2025-05-07T19:45:25.6527238Z 2025-05-07T19:45:25.6527242Z 2025-05-07T19:45:25.6527245Z 2025-05-07T19:45:25.6527248Z 2025-05-07T19:45:25.6527252Z 2025-05-07T19:45:25.6527255Z 2025-05-07T19:45:25.6527259Z 2025-05-07T19:45:25.6527262Z 2025-05-07T19:45:25.6527266Z 2025-05-07T19:45:25.6527294Z 2025-05-07T19:45:25.6527297Z 2025-05-07T19:45:25.6527301Z 2025-05-07T19:45:25.6527623Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.6527967Z 2025-05-07T19:45:25.6527971Z 2025-05-07T19:45:25.6527974Z 2025-05-07T19:45:25.6527978Z 2025-05-07T19:45:25.6527981Z 2025-05-07T19:45:25.6527985Z 2025-05-07T19:45:25.6528016Z 2025-05-07T19:45:25.6528019Z 2025-05-07T19:45:25.6528023Z 2025-05-07T19:45:25.6528026Z 2025-05-07T19:45:25.6528030Z 2025-05-07T19:45:25.6528033Z 2025-05-07T19:45:25.6528036Z 2025-05-07T19:45:25.7755404Z libgfortran5-15.1.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:25.7756086Z 2025-05-07T19:45:25.7756090Z 2025-05-07T19:45:25.7756094Z 2025-05-07T19:45:25.7756098Z 2025-05-07T19:45:25.7756102Z 2025-05-07T19:45:25.7756106Z 2025-05-07T19:45:25.7756109Z 2025-05-07T19:45:25.7756113Z 2025-05-07T19:45:25.7756117Z 2025-05-07T19:45:25.7756120Z 2025-05-07T19:45:25.7756124Z 2025-05-07T19:45:25.7756127Z 2025-05-07T19:45:25.7756130Z 2025-05-07T19:45:25.7756134Z 2025-05-07T19:45:25.7756481Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:25.7756784Z 2025-05-07T19:45:25.7756788Z 2025-05-07T19:45:25.7756791Z 2025-05-07T19:45:25.7756794Z 2025-05-07T19:45:25.7756798Z 2025-05-07T19:45:25.7756801Z 2025-05-07T19:45:25.7756805Z 2025-05-07T19:45:25.7756809Z 2025-05-07T19:45:25.7756812Z 2025-05-07T19:45:25.7756816Z 2025-05-07T19:45:25.7756819Z 2025-05-07T19:45:25.7756823Z 2025-05-07T19:45:25.7756858Z 2025-05-07T19:45:25.7756866Z 2025-05-07T19:45:25.8812417Z krb5-1.21.3 | 1.3 MB | ########## | 100%  2025-05-07T19:45:25.8812766Z 2025-05-07T19:45:25.8812771Z 2025-05-07T19:45:25.8812775Z 2025-05-07T19:45:25.8812779Z 2025-05-07T19:45:25.8812783Z 2025-05-07T19:45:25.8812813Z 2025-05-07T19:45:25.8812816Z 2025-05-07T19:45:25.8812820Z 2025-05-07T19:45:25.8812824Z 2025-05-07T19:45:25.8812827Z 2025-05-07T19:45:25.8812831Z 2025-05-07T19:45:25.8812834Z 2025-05-07T19:45:26.0088680Z harfbuzz-9.0.0 | 1.5 MB | ########## | 100%  2025-05-07T19:45:26.0089687Z 2025-05-07T19:45:26.0089700Z 2025-05-07T19:45:26.0872861Z python-3.10.17 | 23.9 MB | ########## | 100%  2025-05-07T19:45:26.0873191Z 2025-05-07T19:45:26.0873196Z 2025-05-07T19:45:26.0873200Z 2025-05-07T19:45:26.0873203Z 2025-05-07T19:45:26.0873207Z 2025-05-07T19:45:26.0873210Z 2025-05-07T19:45:26.0873214Z 2025-05-07T19:45:26.0873217Z 2025-05-07T19:45:26.0873242Z 2025-05-07T19:45:26.0873246Z 2025-05-07T19:45:26.0873273Z 2025-05-07T19:45:26.0873277Z 2025-05-07T19:45:26.0873280Z 2025-05-07T19:45:26.0873284Z 2025-05-07T19:45:26.0873288Z 2025-05-07T19:45:26.0873291Z 2025-05-07T19:45:26.0873614Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:26.0873936Z 2025-05-07T19:45:26.0873940Z 2025-05-07T19:45:26.0873943Z 2025-05-07T19:45:26.0873947Z 2025-05-07T19:45:26.0873974Z 2025-05-07T19:45:26.0873977Z 2025-05-07T19:45:26.0873981Z 2025-05-07T19:45:26.0873984Z 2025-05-07T19:45:26.0873997Z 2025-05-07T19:45:26.0874013Z 2025-05-07T19:45:26.0874017Z 2025-05-07T19:45:26.0874021Z 2025-05-07T19:45:26.0874024Z 2025-05-07T19:45:26.0874028Z 2025-05-07T19:45:26.0874031Z 2025-05-07T19:45:26.0874034Z 2025-05-07T19:45:26.2796725Z cairo-1.18.0 | 961 KB | ########## | 100%  2025-05-07T19:45:26.2797134Z 2025-05-07T19:45:26.2797159Z 2025-05-07T19:45:26.2797163Z 2025-05-07T19:45:26.2797167Z 2025-05-07T19:45:26.2797170Z 2025-05-07T19:45:26.2797174Z 2025-05-07T19:45:26.2797178Z 2025-05-07T19:45:26.2797181Z 2025-05-07T19:45:26.2797185Z 2025-05-07T19:45:26.2797189Z 2025-05-07T19:45:26.2797193Z 2025-05-07T19:45:26.2797196Z 2025-05-07T19:45:26.2797199Z 2025-05-07T19:45:26.2797203Z 2025-05-07T19:45:26.2797206Z 2025-05-07T19:45:26.2797210Z 2025-05-07T19:45:26.2797213Z 2025-05-07T19:45:26.2797554Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:26.2797886Z 2025-05-07T19:45:26.2797890Z 2025-05-07T19:45:26.2797893Z 2025-05-07T19:45:26.2797897Z 2025-05-07T19:45:26.2797900Z 2025-05-07T19:45:26.2797904Z 2025-05-07T19:45:26.2797907Z 2025-05-07T19:45:26.2797911Z 2025-05-07T19:45:26.2797914Z 2025-05-07T19:45:26.2797917Z 2025-05-07T19:45:26.2797921Z 2025-05-07T19:45:26.2797925Z 2025-05-07T19:45:26.2797928Z 2025-05-07T19:45:26.2797932Z 2025-05-07T19:45:26.2798188Z 2025-05-07T19:45:26.2798219Z 2025-05-07T19:45:26.2798223Z 2025-05-07T19:45:26.3691107Z pcre2-10.44 | 934 KB | ########## | 100%  2025-05-07T19:45:26.3691651Z 2025-05-07T19:45:26.3691661Z 2025-05-07T19:45:26.3691683Z 2025-05-07T19:45:26.3691687Z 2025-05-07T19:45:26.3691691Z 2025-05-07T19:45:26.3691695Z 2025-05-07T19:45:26.3691698Z 2025-05-07T19:45:26.3691702Z 2025-05-07T19:45:26.3691706Z 2025-05-07T19:45:26.3691709Z 2025-05-07T19:45:26.3691713Z 2025-05-07T19:45:26.3691716Z 2025-05-07T19:45:26.3691725Z 2025-05-07T19:45:26.3691749Z 2025-05-07T19:45:26.3691753Z 2025-05-07T19:45:26.3697228Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.3697668Z 2025-05-07T19:45:26.3697672Z 2025-05-07T19:45:26.3697676Z 2025-05-07T19:45:26.3697680Z 2025-05-07T19:45:26.3697683Z 2025-05-07T19:45:26.3697687Z 2025-05-07T19:45:26.3697690Z 2025-05-07T19:45:26.3697714Z 2025-05-07T19:45:26.3697718Z 2025-05-07T19:45:26.3697722Z 2025-05-07T19:45:26.3697726Z 2025-05-07T19:45:26.3697730Z 2025-05-07T19:45:26.3697734Z 2025-05-07T19:45:26.3697738Z 2025-05-07T19:45:26.3697742Z 2025-05-07T19:45:26.4030710Z libabseil-20250127.1 | 1.3 MB | ########## | 100%  2025-05-07T19:45:26.4032250Z 2025-05-07T19:45:26.4032266Z 2025-05-07T19:45:26.4032277Z 2025-05-07T19:45:26.4032287Z 2025-05-07T19:45:26.4032298Z 2025-05-07T19:45:26.4032308Z 2025-05-07T19:45:26.4032319Z 2025-05-07T19:45:26.4032330Z 2025-05-07T19:45:26.4033017Z 2025-05-07T19:45:26.4033037Z 2025-05-07T19:45:26.4033047Z 2025-05-07T19:45:26.4033100Z 2025-05-07T19:45:26.4033110Z 2025-05-07T19:45:26.4033121Z 2025-05-07T19:45:26.4033131Z 2025-05-07T19:45:26.4033141Z 2025-05-07T19:45:26.4033152Z 2025-05-07T19:45:26.4033162Z 2025-05-07T19:45:26.4033937Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:26.4034298Z 2025-05-07T19:45:26.4034302Z 2025-05-07T19:45:26.4034306Z 2025-05-07T19:45:26.4034310Z 2025-05-07T19:45:26.4034314Z 2025-05-07T19:45:26.4034317Z 2025-05-07T19:45:26.4034321Z 2025-05-07T19:45:26.4034324Z 2025-05-07T19:45:26.4034359Z 2025-05-07T19:45:26.4034362Z 2025-05-07T19:45:26.4034366Z 2025-05-07T19:45:26.4034369Z 2025-05-07T19:45:26.4034373Z 2025-05-07T19:45:26.4034376Z 2025-05-07T19:45:26.4034380Z 2025-05-07T19:45:26.4034383Z 2025-05-07T19:45:26.4034386Z 2025-05-07T19:45:26.4034390Z 2025-05-07T19:45:26.5313899Z libsqlite-3.49.2 | 895 KB | ########## | 100%  2025-05-07T19:45:26.5314319Z 2025-05-07T19:45:26.5314324Z 2025-05-07T19:45:26.5314327Z 2025-05-07T19:45:27.0530669Z cmake-4.0.2 | 19.4 MB | ########## | 100%  2025-05-07T19:45:27.6359011Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:27.6359899Z 2025-05-07T19:45:27.6359913Z 2025-05-07T19:45:27.6359959Z 2025-05-07T19:45:27.6359970Z 2025-05-07T19:45:27.6359981Z 2025-05-07T19:45:27.6359992Z 2025-05-07T19:45:27.6360002Z 2025-05-07T19:45:27.6360012Z 2025-05-07T19:45:27.6360023Z 2025-05-07T19:45:27.6360033Z 2025-05-07T19:45:27.6360043Z 2025-05-07T19:45:27.6360054Z 2025-05-07T19:45:27.6360065Z 2025-05-07T19:45:27.6360075Z 2025-05-07T19:45:27.6360085Z 2025-05-07T19:45:27.6360096Z 2025-05-07T19:45:27.6360106Z 2025-05-07T19:45:27.6360116Z 2025-05-07T19:45:27.6360159Z 2025-05-07T19:45:27.6361055Z ... (more hidden) ... 2025-05-07T19:45:27.6361951Z 2025-05-07T19:45:27.6361962Z 2025-05-07T19:45:27.6361974Z 2025-05-07T19:45:27.6361984Z 2025-05-07T19:45:27.6361994Z 2025-05-07T19:45:27.6362004Z 2025-05-07T19:45:27.6362015Z 2025-05-07T19:45:27.6362025Z 2025-05-07T19:45:27.6362035Z 2025-05-07T19:45:27.6362073Z 2025-05-07T19:45:27.6362083Z 2025-05-07T19:45:27.6362093Z 2025-05-07T19:45:27.6362103Z 2025-05-07T19:45:27.6362113Z 2025-05-07T19:45:27.6362565Z 2025-05-07T19:45:27.6362576Z 2025-05-07T19:45:27.6362587Z 2025-05-07T19:45:27.6362597Z 2025-05-07T19:45:27.6362607Z 2025-05-07T19:45:28.2661539Z ... (more hidden) ... 2025-05-07T19:45:28.2662468Z 2025-05-07T19:45:28.8030434Z bazel-7.5.0 | 47.4 MB | ########## | 100%  2025-05-07T19:45:28.8035959Z openjdk-23.0.1 | 181.3 MB | ########## | 100% 2025-05-07T19:45:28.8036232Z 2025-05-07T19:45:28.8036326Z 2025-05-07T19:45:28.8036495Z 2025-05-07T19:45:28.8036499Z 2025-05-07T19:45:28.8036503Z 2025-05-07T19:45:28.8036525Z 2025-05-07T19:45:28.8036529Z 2025-05-07T19:45:28.8036533Z 2025-05-07T19:45:28.8036537Z 2025-05-07T19:45:28.8036540Z 2025-05-07T19:45:28.8036544Z 2025-05-07T19:45:28.8036547Z 2025-05-07T19:45:28.8036552Z 2025-05-07T19:45:28.8036555Z 2025-05-07T19:45:28.8036560Z 2025-05-07T19:45:28.8036563Z 2025-05-07T19:45:28.8036567Z 2025-05-07T19:45:28.8036570Z 2025-05-07T19:45:28.8036621Z 2025-05-07T19:45:28.8036786Z 2025-05-07T19:45:28.8037234Z  2025-05-07T19:45:28.8037630Z 2025-05-07T19:45:28.8037884Z 2025-05-07T19:45:28.8038136Z  2025-05-07T19:45:28.8038392Z 2025-05-07T19:45:28.8038396Z 2025-05-07T19:45:28.8038574Z  2025-05-07T19:45:28.8038807Z 2025-05-07T19:45:28.8038836Z 2025-05-07T19:45:28.8038839Z 2025-05-07T19:45:28.8039276Z  2025-05-07T19:45:28.8039512Z 2025-05-07T19:45:28.8039517Z 2025-05-07T19:45:28.8039520Z 2025-05-07T19:45:28.8039524Z 2025-05-07T19:45:28.8039747Z  2025-05-07T19:45:28.8039981Z 2025-05-07T19:45:28.8039984Z 2025-05-07T19:45:28.8039988Z 2025-05-07T19:45:28.8039992Z 2025-05-07T19:45:28.8040001Z 2025-05-07T19:45:28.8040200Z  2025-05-07T19:45:28.8040467Z 2025-05-07T19:45:28.8040470Z 2025-05-07T19:45:28.8040474Z 2025-05-07T19:45:28.8040477Z 2025-05-07T19:45:28.8040481Z 2025-05-07T19:45:28.8040484Z 2025-05-07T19:45:28.8040710Z  2025-05-07T19:45:28.8040978Z 2025-05-07T19:45:28.8040981Z 2025-05-07T19:45:28.8040985Z 2025-05-07T19:45:28.8040988Z 2025-05-07T19:45:28.8040992Z 2025-05-07T19:45:28.8040995Z 2025-05-07T19:45:28.8040999Z 2025-05-07T19:45:28.8041202Z  2025-05-07T19:45:28.8041476Z 2025-05-07T19:45:28.8041480Z 2025-05-07T19:45:28.8041484Z 2025-05-07T19:45:28.8041487Z 2025-05-07T19:45:28.8041491Z 2025-05-07T19:45:28.8041494Z 2025-05-07T19:45:28.8041498Z 2025-05-07T19:45:28.8041501Z 2025-05-07T19:45:28.8041700Z  2025-05-07T19:45:28.8041973Z 2025-05-07T19:45:28.8041977Z 2025-05-07T19:45:28.8041981Z 2025-05-07T19:45:28.8041984Z 2025-05-07T19:45:28.8041988Z 2025-05-07T19:45:28.8041991Z 2025-05-07T19:45:28.8041994Z 2025-05-07T19:45:28.8041998Z 2025-05-07T19:45:28.8042001Z 2025-05-07T19:45:28.8042212Z  2025-05-07T19:45:28.8042490Z 2025-05-07T19:45:28.8042494Z 2025-05-07T19:45:28.8042498Z 2025-05-07T19:45:28.8042501Z 2025-05-07T19:45:28.8042505Z 2025-05-07T19:45:28.8042508Z 2025-05-07T19:45:28.8042515Z 2025-05-07T19:45:28.8042519Z 2025-05-07T19:45:28.8042523Z 2025-05-07T19:45:28.8042526Z 2025-05-07T19:45:28.8042734Z  2025-05-07T19:45:28.8043015Z 2025-05-07T19:45:28.8043019Z 2025-05-07T19:45:28.8043022Z 2025-05-07T19:45:28.8043026Z 2025-05-07T19:45:28.8043029Z 2025-05-07T19:45:28.8043032Z 2025-05-07T19:45:28.8043127Z 2025-05-07T19:45:28.8043131Z 2025-05-07T19:45:28.8043134Z 2025-05-07T19:45:28.8043138Z 2025-05-07T19:45:28.8043141Z 2025-05-07T19:45:28.8043361Z  2025-05-07T19:45:28.8043644Z 2025-05-07T19:45:28.8043648Z 2025-05-07T19:45:28.8043651Z 2025-05-07T19:45:28.8043655Z 2025-05-07T19:45:28.8043658Z 2025-05-07T19:45:28.8043662Z 2025-05-07T19:45:28.8043665Z 2025-05-07T19:45:28.8043669Z 2025-05-07T19:45:28.8043672Z 2025-05-07T19:45:28.8043675Z 2025-05-07T19:45:28.8043679Z 2025-05-07T19:45:28.8043682Z 2025-05-07T19:45:28.8043903Z  2025-05-07T19:45:28.8044190Z 2025-05-07T19:45:28.8044194Z 2025-05-07T19:45:28.8044197Z 2025-05-07T19:45:28.8044201Z 2025-05-07T19:45:28.8044204Z 2025-05-07T19:45:28.8044208Z 2025-05-07T19:45:28.8044211Z 2025-05-07T19:45:28.8044215Z 2025-05-07T19:45:28.8044218Z 2025-05-07T19:45:28.8044222Z 2025-05-07T19:45:28.8044230Z 2025-05-07T19:45:28.8044233Z 2025-05-07T19:45:28.8044237Z 2025-05-07T19:45:28.8044462Z  2025-05-07T19:45:28.8044757Z 2025-05-07T19:45:28.8044760Z 2025-05-07T19:45:28.8044764Z 2025-05-07T19:45:28.8044768Z 2025-05-07T19:45:28.8044771Z 2025-05-07T19:45:28.8044775Z 2025-05-07T19:45:28.8044778Z 2025-05-07T19:45:28.8044781Z 2025-05-07T19:45:28.8044785Z 2025-05-07T19:45:28.8044789Z 2025-05-07T19:45:28.8044792Z 2025-05-07T19:45:28.8044796Z 2025-05-07T19:45:28.8044800Z 2025-05-07T19:45:28.8044866Z 2025-05-07T19:45:28.8045171Z  2025-05-07T19:45:28.8045434Z 2025-05-07T19:45:28.8045438Z 2025-05-07T19:45:28.8045441Z 2025-05-07T19:45:28.8045445Z 2025-05-07T19:45:28.8045448Z 2025-05-07T19:45:28.8045477Z 2025-05-07T19:45:28.8045480Z 2025-05-07T19:45:28.8045484Z 2025-05-07T19:45:28.8045487Z 2025-05-07T19:45:28.8045495Z 2025-05-07T19:45:28.8045498Z 2025-05-07T19:45:28.8045502Z 2025-05-07T19:45:28.8045505Z 2025-05-07T19:45:28.8045509Z 2025-05-07T19:45:28.8045512Z 2025-05-07T19:45:28.8045742Z  2025-05-07T19:45:28.8046031Z 2025-05-07T19:45:28.8046035Z 2025-05-07T19:45:28.8046038Z 2025-05-07T19:45:28.8046042Z 2025-05-07T19:45:28.8046045Z 2025-05-07T19:45:28.8046049Z 2025-05-07T19:45:28.8046052Z 2025-05-07T19:45:28.8046055Z 2025-05-07T19:45:28.8046059Z 2025-05-07T19:45:28.8046062Z 2025-05-07T19:45:28.8046069Z 2025-05-07T19:45:28.8046073Z 2025-05-07T19:45:28.8046077Z 2025-05-07T19:45:28.8046081Z 2025-05-07T19:45:28.8046084Z 2025-05-07T19:45:28.8046087Z 2025-05-07T19:45:28.8046328Z  2025-05-07T19:45:28.8046629Z 2025-05-07T19:45:28.8046632Z 2025-05-07T19:45:28.8046636Z 2025-05-07T19:45:28.8046643Z 2025-05-07T19:45:28.8046646Z 2025-05-07T19:45:28.8046650Z 2025-05-07T19:45:28.8046654Z 2025-05-07T19:45:28.8046657Z 2025-05-07T19:45:28.8046660Z 2025-05-07T19:45:28.8046664Z 2025-05-07T19:45:28.8046667Z 2025-05-07T19:45:28.8046670Z 2025-05-07T19:45:28.8046674Z 2025-05-07T19:45:28.8046677Z 2025-05-07T19:45:28.8046680Z 2025-05-07T19:45:28.8046684Z 2025-05-07T19:45:28.8046687Z 2025-05-07T19:45:28.8046961Z  2025-05-07T19:45:28.8047232Z 2025-05-07T19:45:28.8047235Z 2025-05-07T19:45:28.8047242Z 2025-05-07T19:45:28.8047246Z 2025-05-07T19:45:28.8047249Z 2025-05-07T19:45:28.8047253Z 2025-05-07T19:45:28.8047256Z 2025-05-07T19:45:28.8047260Z 2025-05-07T19:45:28.8047263Z 2025-05-07T19:45:28.8047267Z 2025-05-07T19:45:28.8047298Z 2025-05-07T19:45:28.8047302Z 2025-05-07T19:45:28.8047305Z 2025-05-07T19:45:28.8047309Z 2025-05-07T19:45:28.8047312Z 2025-05-07T19:45:28.8047316Z 2025-05-07T19:45:28.8047419Z 2025-05-07T19:45:28.8047423Z 2025-05-07T19:45:28.8047669Z  2025-05-07T19:45:28.8047947Z 2025-05-07T19:45:28.8047977Z 2025-05-07T19:45:28.8048095Z  2025-05-07T19:45:28.8048219Z 2025-05-07T19:45:28.8048222Z 2025-05-07T19:45:28.8048334Z  2025-05-07T19:45:28.8048488Z 2025-05-07T19:45:28.8048491Z 2025-05-07T19:45:28.8048495Z 2025-05-07T19:45:28.8048610Z  2025-05-07T19:45:28.8048734Z 2025-05-07T19:45:28.8048738Z 2025-05-07T19:45:28.8048742Z 2025-05-07T19:45:28.8048749Z 2025-05-07T19:45:28.8048899Z  2025-05-07T19:45:28.8049034Z 2025-05-07T19:45:28.8049037Z 2025-05-07T19:45:28.8049041Z 2025-05-07T19:45:28.8049044Z 2025-05-07T19:45:28.8049048Z 2025-05-07T19:45:28.8049170Z  2025-05-07T19:45:28.8049333Z 2025-05-07T19:45:28.8049337Z 2025-05-07T19:45:28.8049341Z 2025-05-07T19:45:28.8049345Z 2025-05-07T19:45:28.8049349Z 2025-05-07T19:45:28.8049356Z 2025-05-07T19:45:28.8049477Z  2025-05-07T19:45:28.8049652Z 2025-05-07T19:45:28.8049655Z 2025-05-07T19:45:28.8049659Z 2025-05-07T19:45:28.8049662Z 2025-05-07T19:45:28.8049666Z 2025-05-07T19:45:28.8049669Z 2025-05-07T19:45:28.8049672Z 2025-05-07T19:45:28.8049797Z  2025-05-07T19:45:28.8049960Z 2025-05-07T19:45:28.8049964Z 2025-05-07T19:45:28.8049991Z 2025-05-07T19:45:28.8049995Z 2025-05-07T19:45:28.8049998Z 2025-05-07T19:45:28.8050002Z 2025-05-07T19:45:28.8050005Z 2025-05-07T19:45:28.8050009Z 2025-05-07T19:45:28.8050199Z  2025-05-07T19:45:28.8050474Z 2025-05-07T19:45:28.8050477Z 2025-05-07T19:45:28.8050481Z 2025-05-07T19:45:28.8050484Z 2025-05-07T19:45:28.8050488Z 2025-05-07T19:45:28.8050491Z 2025-05-07T19:45:28.8050494Z 2025-05-07T19:45:28.8050498Z 2025-05-07T19:45:28.8050501Z 2025-05-07T19:45:28.8050637Z  2025-05-07T19:45:28.8050837Z 2025-05-07T19:45:28.8050840Z 2025-05-07T19:45:28.8050848Z 2025-05-07T19:45:28.8050851Z 2025-05-07T19:45:28.8050855Z 2025-05-07T19:45:28.8050858Z 2025-05-07T19:45:28.8050861Z 2025-05-07T19:45:28.8050865Z 2025-05-07T19:45:28.8050868Z 2025-05-07T19:45:28.8050872Z 2025-05-07T19:45:28.8051020Z  2025-05-07T19:45:28.8051228Z 2025-05-07T19:45:28.8051231Z 2025-05-07T19:45:28.8051235Z 2025-05-07T19:45:28.8051239Z 2025-05-07T19:45:28.8051243Z 2025-05-07T19:45:28.8051246Z 2025-05-07T19:45:28.8051250Z 2025-05-07T19:45:28.8051253Z 2025-05-07T19:45:28.8051256Z 2025-05-07T19:45:28.8051260Z 2025-05-07T19:45:28.8051267Z 2025-05-07T19:45:28.8051409Z  2025-05-07T19:45:28.8051626Z 2025-05-07T19:45:28.8051630Z 2025-05-07T19:45:28.8051633Z 2025-05-07T19:45:28.8051637Z 2025-05-07T19:45:28.8051640Z 2025-05-07T19:45:28.8051644Z 2025-05-07T19:45:28.8051647Z 2025-05-07T19:45:28.8051651Z 2025-05-07T19:45:28.8051654Z 2025-05-07T19:45:28.8051657Z 2025-05-07T19:45:28.8051661Z 2025-05-07T19:45:28.8051667Z 2025-05-07T19:45:28.8051813Z  2025-05-07T19:45:28.8052050Z 2025-05-07T19:45:28.8052053Z 2025-05-07T19:45:28.8052057Z 2025-05-07T19:45:28.8052061Z 2025-05-07T19:45:28.8052064Z 2025-05-07T19:45:28.8052068Z 2025-05-07T19:45:28.8052071Z 2025-05-07T19:45:28.8052075Z 2025-05-07T19:45:28.8052078Z 2025-05-07T19:45:28.8052082Z 2025-05-07T19:45:28.8052085Z 2025-05-07T19:45:28.8052088Z 2025-05-07T19:45:28.8052092Z 2025-05-07T19:45:28.8052268Z  2025-05-07T19:45:28.8052476Z 2025-05-07T19:45:28.8052480Z 2025-05-07T19:45:28.8052487Z 2025-05-07T19:45:28.8052491Z 2025-05-07T19:45:28.8052495Z 2025-05-07T19:45:28.8052499Z 2025-05-07T19:45:28.8052502Z 2025-05-07T19:45:28.8052507Z 2025-05-07T19:45:28.8052510Z 2025-05-07T19:45:28.8052513Z 2025-05-07T19:45:28.8052517Z 2025-05-07T19:45:28.8052520Z 2025-05-07T19:45:28.8052524Z 2025-05-07T19:45:28.8052527Z 2025-05-07T19:45:28.8052712Z  2025-05-07T19:45:28.8052985Z 2025-05-07T19:45:28.8052989Z 2025-05-07T19:45:28.8052993Z 2025-05-07T19:45:28.8052996Z 2025-05-07T19:45:28.8052999Z 2025-05-07T19:45:28.8053003Z 2025-05-07T19:45:28.8053006Z 2025-05-07T19:45:28.8053010Z 2025-05-07T19:45:28.8053014Z 2025-05-07T19:45:28.8053018Z 2025-05-07T19:45:28.8053021Z 2025-05-07T19:45:28.8053025Z 2025-05-07T19:45:28.8053052Z 2025-05-07T19:45:28.8053056Z 2025-05-07T19:45:28.8053059Z 2025-05-07T19:45:28.8053219Z  2025-05-07T19:45:28.8053438Z 2025-05-07T19:45:28.8053442Z 2025-05-07T19:45:28.8053449Z 2025-05-07T19:45:28.8053452Z 2025-05-07T19:45:28.8053456Z 2025-05-07T19:45:28.8053459Z 2025-05-07T19:45:28.8053463Z 2025-05-07T19:45:28.8053489Z 2025-05-07T19:45:28.8053493Z 2025-05-07T19:45:28.8053496Z 2025-05-07T19:45:28.8053500Z 2025-05-07T19:45:28.8053503Z 2025-05-07T19:45:28.8053506Z 2025-05-07T19:45:28.8053510Z 2025-05-07T19:45:28.8053513Z 2025-05-07T19:45:28.8053517Z 2025-05-07T19:45:28.8053689Z  2025-05-07T19:45:28.8053916Z 2025-05-07T19:45:28.8053919Z 2025-05-07T19:45:28.8053948Z 2025-05-07T19:45:28.8053952Z 2025-05-07T19:45:28.8053955Z 2025-05-07T19:45:28.8053959Z 2025-05-07T19:45:28.8053962Z 2025-05-07T19:45:28.8053966Z 2025-05-07T19:45:28.8053969Z 2025-05-07T19:45:28.8053973Z 2025-05-07T19:45:28.8053976Z 2025-05-07T19:45:28.8053980Z 2025-05-07T19:45:28.8053983Z 2025-05-07T19:45:28.8053987Z 2025-05-07T19:45:28.8053990Z 2025-05-07T19:45:28.8053994Z 2025-05-07T19:45:28.8053997Z 2025-05-07T19:45:28.8054222Z  2025-05-07T19:45:28.8054480Z 2025-05-07T19:45:28.8054483Z 2025-05-07T19:45:28.8054487Z 2025-05-07T19:45:28.8054491Z 2025-05-07T19:45:28.8054494Z 2025-05-07T19:45:28.8054498Z 2025-05-07T19:45:28.8054501Z 2025-05-07T19:45:28.8054504Z 2025-05-07T19:45:28.8054508Z 2025-05-07T19:45:28.8054511Z 2025-05-07T19:45:28.8054515Z 2025-05-07T19:45:28.8054518Z 2025-05-07T19:45:28.8054525Z 2025-05-07T19:45:28.8054528Z 2025-05-07T19:45:28.8054532Z 2025-05-07T19:45:28.8054535Z 2025-05-07T19:45:28.8054539Z 2025-05-07T19:45:28.8054542Z 2025-05-07T19:45:28.8054755Z  2025-05-07T19:45:28.8054992Z 2025-05-07T19:45:28.8054996Z 2025-05-07T19:45:28.8055100Z  2025-05-07T19:45:28.8055241Z 2025-05-07T19:45:28.8055244Z 2025-05-07T19:45:28.8055354Z  2025-05-07T19:45:28.8055473Z 2025-05-07T19:45:28.8055476Z 2025-05-07T19:45:28.8055480Z 2025-05-07T19:45:28.8055616Z  2025-05-07T19:45:28.8055738Z 2025-05-07T19:45:28.8055745Z 2025-05-07T19:45:28.8055749Z 2025-05-07T19:45:28.8055753Z 2025-05-07T19:45:28.8055868Z  2025-05-07T19:45:28.8056036Z 2025-05-07T19:45:28.8056040Z 2025-05-07T19:45:28.8056043Z 2025-05-07T19:45:28.8056047Z 2025-05-07T19:45:28.8056051Z 2025-05-07T19:45:28.8056169Z  2025-05-07T19:45:28.8056306Z 2025-05-07T19:45:28.8056310Z 2025-05-07T19:45:28.8056313Z 2025-05-07T19:45:28.8056321Z 2025-05-07T19:45:28.8056350Z 2025-05-07T19:45:28.8056354Z 2025-05-07T19:45:28.8056474Z  2025-05-07T19:45:28.8056617Z 2025-05-07T19:45:28.8056620Z 2025-05-07T19:45:28.8056624Z 2025-05-07T19:45:28.8056627Z 2025-05-07T19:45:28.8056631Z 2025-05-07T19:45:28.8056635Z 2025-05-07T19:45:28.8056638Z 2025-05-07T19:45:28.8056787Z  2025-05-07T19:45:28.8056941Z 2025-05-07T19:45:28.8056945Z 2025-05-07T19:45:28.8056948Z 2025-05-07T19:45:28.8056952Z 2025-05-07T19:45:28.8056955Z 2025-05-07T19:45:28.8056958Z 2025-05-07T19:45:28.8056962Z 2025-05-07T19:45:28.8056968Z 2025-05-07T19:45:28.8057125Z  2025-05-07T19:45:28.8057290Z 2025-05-07T19:45:28.8057293Z 2025-05-07T19:45:28.8057297Z 2025-05-07T19:45:28.8057301Z 2025-05-07T19:45:28.8057304Z 2025-05-07T19:45:28.8057308Z 2025-05-07T19:45:28.8057311Z 2025-05-07T19:45:28.8057314Z 2025-05-07T19:45:28.8057318Z 2025-05-07T19:45:28.8057451Z  2025-05-07T19:45:28.8057823Z 2025-05-07T19:45:28.8057827Z 2025-05-07T19:45:28.8057830Z 2025-05-07T19:45:28.8057834Z 2025-05-07T19:45:28.8057837Z 2025-05-07T19:45:28.8057842Z 2025-05-07T19:45:28.8057845Z 2025-05-07T19:45:28.8057849Z 2025-05-07T19:45:28.8057853Z 2025-05-07T19:45:28.8057857Z 2025-05-07T19:45:28.8057999Z  2025-05-07T19:45:28.8058218Z 2025-05-07T19:45:28.8058222Z 2025-05-07T19:45:28.8058225Z 2025-05-07T19:45:28.8058228Z 2025-05-07T19:45:28.8058232Z 2025-05-07T19:45:28.8058235Z 2025-05-07T19:45:28.8058239Z 2025-05-07T19:45:28.8058242Z 2025-05-07T19:45:28.8058249Z 2025-05-07T19:45:28.8058252Z 2025-05-07T19:45:28.8058256Z 2025-05-07T19:45:28.8058397Z  2025-05-07T19:45:28.8058614Z 2025-05-07T19:45:28.8058617Z 2025-05-07T19:45:28.8058621Z 2025-05-07T19:45:28.8058625Z 2025-05-07T19:45:28.8058628Z 2025-05-07T19:45:28.8058631Z 2025-05-07T19:45:28.8058635Z 2025-05-07T19:45:28.8058639Z 2025-05-07T19:45:28.8058642Z 2025-05-07T19:45:28.8058649Z 2025-05-07T19:45:28.8058652Z 2025-05-07T19:45:28.8058656Z 2025-05-07T19:45:28.8058829Z  2025-05-07T19:45:28.8059031Z 2025-05-07T19:45:28.8059034Z 2025-05-07T19:45:28.8059038Z 2025-05-07T19:45:28.8059041Z 2025-05-07T19:45:28.8059044Z 2025-05-07T19:45:28.8059048Z 2025-05-07T19:45:28.8059051Z 2025-05-07T19:45:28.8059055Z 2025-05-07T19:45:28.8059058Z 2025-05-07T19:45:28.8059062Z 2025-05-07T19:45:28.8059065Z 2025-05-07T19:45:28.8059068Z 2025-05-07T19:45:28.8059072Z 2025-05-07T19:45:28.8059248Z  2025-05-07T19:45:28.8059523Z 2025-05-07T19:45:28.8059527Z 2025-05-07T19:45:28.8059531Z 2025-05-07T19:45:28.8059534Z 2025-05-07T19:45:28.8059538Z 2025-05-07T19:45:28.8059541Z 2025-05-07T19:45:28.8059545Z 2025-05-07T19:45:28.8059548Z 2025-05-07T19:45:28.8059551Z 2025-05-07T19:45:28.8059555Z 2025-05-07T19:45:28.8059559Z 2025-05-07T19:45:28.8059562Z 2025-05-07T19:45:28.8059565Z 2025-05-07T19:45:28.8059597Z 2025-05-07T19:45:28.8059754Z  2025-05-07T19:45:28.8059970Z 2025-05-07T19:45:28.8059974Z 2025-05-07T19:45:28.8059977Z 2025-05-07T19:45:28.8059980Z 2025-05-07T19:45:28.8059984Z 2025-05-07T19:45:28.8059987Z 2025-05-07T19:45:28.8059991Z 2025-05-07T19:45:28.8059994Z 2025-05-07T19:45:28.8059998Z 2025-05-07T19:45:28.8060001Z 2025-05-07T19:45:28.8060032Z 2025-05-07T19:45:28.8060035Z 2025-05-07T19:45:28.8060039Z 2025-05-07T19:45:28.8060043Z 2025-05-07T19:45:28.8060046Z 2025-05-07T19:45:28.8060208Z  2025-05-07T19:45:28.8060430Z 2025-05-07T19:45:28.8060434Z 2025-05-07T19:45:28.8060437Z 2025-05-07T19:45:28.8060441Z 2025-05-07T19:45:28.8060444Z 2025-05-07T19:45:28.8060448Z 2025-05-07T19:45:28.8060475Z 2025-05-07T19:45:28.8060478Z 2025-05-07T19:45:28.8060482Z 2025-05-07T19:45:28.8060485Z 2025-05-07T19:45:28.8060488Z 2025-05-07T19:45:28.8060492Z 2025-05-07T19:45:28.8060495Z 2025-05-07T19:45:28.8060498Z 2025-05-07T19:45:28.8060505Z 2025-05-07T19:45:28.8060509Z 2025-05-07T19:45:28.8060674Z  2025-05-07T19:45:28.8060926Z 2025-05-07T19:45:28.8060929Z 2025-05-07T19:45:28.8060933Z 2025-05-07T19:45:28.8060936Z 2025-05-07T19:45:28.8060940Z 2025-05-07T19:45:28.8060943Z 2025-05-07T19:45:28.8060947Z 2025-05-07T19:45:28.8060950Z 2025-05-07T19:45:28.8060954Z 2025-05-07T19:45:28.8060957Z 2025-05-07T19:45:28.8060960Z 2025-05-07T19:45:28.8060964Z 2025-05-07T19:45:28.8060967Z 2025-05-07T19:45:28.8060970Z 2025-05-07T19:45:28.8060974Z 2025-05-07T19:45:28.8060981Z 2025-05-07T19:45:28.8060985Z 2025-05-07T19:45:28.8061158Z  2025-05-07T19:45:28.8061415Z 2025-05-07T19:45:28.8061418Z 2025-05-07T19:45:28.8061422Z 2025-05-07T19:45:28.8061425Z 2025-05-07T19:45:28.8061429Z 2025-05-07T19:45:28.8061432Z 2025-05-07T19:45:28.8061436Z 2025-05-07T19:45:28.8061440Z 2025-05-07T19:45:28.8061443Z 2025-05-07T19:45:28.8061505Z 2025-05-07T19:45:28.8061508Z 2025-05-07T19:45:28.8061512Z 2025-05-07T19:45:28.8061515Z 2025-05-07T19:45:28.8061519Z 2025-05-07T19:45:28.8061522Z 2025-05-07T19:45:28.8061525Z 2025-05-07T19:45:28.8061529Z 2025-05-07T19:45:28.8061532Z 2025-05-07T19:45:28.8061735Z  2025-05-07T19:45:28.8061972Z 2025-05-07T19:45:28.8061976Z 2025-05-07T19:45:28.8062086Z  2025-05-07T19:45:28.8062236Z 2025-05-07T19:45:28.8062240Z 2025-05-07T19:45:28.8062347Z  2025-05-07T19:45:28.8062469Z 2025-05-07T19:45:28.8062473Z 2025-05-07T19:45:28.8062477Z 2025-05-07T19:45:28.8062625Z  2025-05-07T19:45:28.8062746Z 2025-05-07T19:45:28.8062750Z 2025-05-07T19:45:28.8062754Z 2025-05-07T19:45:28.8062757Z 2025-05-07T19:45:28.8062873Z  2025-05-07T19:45:28.8063028Z 2025-05-07T19:45:28.8063031Z 2025-05-07T19:45:28.8063035Z 2025-05-07T19:45:28.8063038Z 2025-05-07T19:45:28.8063041Z 2025-05-07T19:45:28.8063161Z  2025-05-07T19:45:28.8063302Z 2025-05-07T19:45:28.8063306Z 2025-05-07T19:45:28.8063333Z 2025-05-07T19:45:28.8063336Z 2025-05-07T19:45:28.8063340Z 2025-05-07T19:45:28.8063343Z 2025-05-07T19:45:28.8063467Z  2025-05-07T19:45:28.8063607Z 2025-05-07T19:45:28.8063610Z 2025-05-07T19:45:28.8063614Z 2025-05-07T19:45:28.8063617Z 2025-05-07T19:45:28.8063621Z 2025-05-07T19:45:28.8063624Z 2025-05-07T19:45:28.8063628Z 2025-05-07T19:45:28.8063779Z  2025-05-07T19:45:28.8063932Z 2025-05-07T19:45:28.8063935Z 2025-05-07T19:45:28.8063939Z 2025-05-07T19:45:28.8063942Z 2025-05-07T19:45:28.8064007Z 2025-05-07T19:45:28.8064011Z 2025-05-07T19:45:28.8064015Z 2025-05-07T19:45:28.8064018Z 2025-05-07T19:45:28.8064173Z  2025-05-07T19:45:28.8064338Z 2025-05-07T19:45:28.8064342Z 2025-05-07T19:45:28.8064346Z 2025-05-07T19:45:28.8064349Z 2025-05-07T19:45:28.8064352Z 2025-05-07T19:45:28.8064356Z 2025-05-07T19:45:28.8064359Z 2025-05-07T19:45:28.8064363Z 2025-05-07T19:45:28.8064370Z 2025-05-07T19:45:28.8064529Z  2025-05-07T19:45:28.8064699Z 2025-05-07T19:45:28.8064702Z 2025-05-07T19:45:28.8064706Z 2025-05-07T19:45:28.8064709Z 2025-05-07T19:45:28.8064713Z 2025-05-07T19:45:28.8064716Z 2025-05-07T19:45:28.8064720Z 2025-05-07T19:45:28.8064723Z 2025-05-07T19:45:28.8064727Z 2025-05-07T19:45:28.8064730Z 2025-05-07T19:45:28.8064867Z  2025-05-07T19:45:28.8065069Z 2025-05-07T19:45:28.8065073Z 2025-05-07T19:45:28.8065077Z 2025-05-07T19:45:28.8065081Z 2025-05-07T19:45:28.8065084Z 2025-05-07T19:45:28.8065088Z 2025-05-07T19:45:28.8065094Z 2025-05-07T19:45:28.8065098Z 2025-05-07T19:45:28.8065101Z 2025-05-07T19:45:28.8065104Z 2025-05-07T19:45:28.8065108Z 2025-05-07T19:45:28.8065274Z  2025-05-07T19:45:28.8065462Z 2025-05-07T19:45:28.8065466Z 2025-05-07T19:45:28.8065470Z 2025-05-07T19:45:28.8065473Z 2025-05-07T19:45:28.8065476Z 2025-05-07T19:45:28.8065480Z 2025-05-07T19:45:28.8065487Z 2025-05-07T19:45:28.8065490Z 2025-05-07T19:45:28.8065494Z 2025-05-07T19:45:28.8065497Z 2025-05-07T19:45:28.8065501Z 2025-05-07T19:45:28.8065504Z 2025-05-07T19:45:28.8065670Z  2025-05-07T19:45:28.8065868Z 2025-05-07T19:45:28.8065872Z 2025-05-07T19:45:28.8065875Z 2025-05-07T19:45:28.8065879Z 2025-05-07T19:45:28.8065882Z 2025-05-07T19:45:28.8065886Z 2025-05-07T19:45:28.8065889Z 2025-05-07T19:45:28.8065892Z 2025-05-07T19:45:28.8065896Z 2025-05-07T19:45:28.8065899Z 2025-05-07T19:45:28.8065902Z 2025-05-07T19:45:28.8065906Z 2025-05-07T19:45:28.8065909Z 2025-05-07T19:45:28.8066086Z  2025-05-07T19:45:28.8066293Z 2025-05-07T19:45:28.8066296Z 2025-05-07T19:45:28.8066299Z 2025-05-07T19:45:28.8066303Z 2025-05-07T19:45:28.8066306Z 2025-05-07T19:45:28.8066310Z 2025-05-07T19:45:28.8066313Z 2025-05-07T19:45:28.8066317Z 2025-05-07T19:45:28.8066320Z 2025-05-07T19:45:28.8066324Z 2025-05-07T19:45:28.8066328Z 2025-05-07T19:45:28.8066401Z 2025-05-07T19:45:28.8066404Z 2025-05-07T19:45:28.8066434Z 2025-05-07T19:45:28.8066587Z  2025-05-07T19:45:28.8066799Z 2025-05-07T19:45:28.8066803Z 2025-05-07T19:45:28.8066806Z 2025-05-07T19:45:28.8066810Z 2025-05-07T19:45:28.8066813Z 2025-05-07T19:45:28.8066817Z 2025-05-07T19:45:28.8066820Z 2025-05-07T19:45:28.8066824Z 2025-05-07T19:45:28.8066828Z 2025-05-07T19:45:28.8066832Z 2025-05-07T19:45:28.8066858Z 2025-05-07T19:45:28.8066862Z 2025-05-07T19:45:28.8066865Z 2025-05-07T19:45:28.8066869Z 2025-05-07T19:45:28.8066872Z 2025-05-07T19:45:28.8067035Z  2025-05-07T19:45:28.8067254Z 2025-05-07T19:45:28.8067259Z 2025-05-07T19:45:28.8067263Z 2025-05-07T19:45:28.8067267Z 2025-05-07T19:45:28.8067271Z 2025-05-07T19:45:28.8067274Z 2025-05-07T19:45:28.8067301Z 2025-05-07T19:45:28.8067305Z 2025-05-07T19:45:28.8067308Z 2025-05-07T19:45:28.8067312Z 2025-05-07T19:45:28.8067315Z 2025-05-07T19:45:28.8067322Z 2025-05-07T19:45:28.8067325Z 2025-05-07T19:45:28.8067329Z 2025-05-07T19:45:28.8067333Z 2025-05-07T19:45:28.8067336Z 2025-05-07T19:45:28.8067498Z  2025-05-07T19:45:28.8067752Z 2025-05-07T19:45:28.8067755Z 2025-05-07T19:45:28.8067759Z 2025-05-07T19:45:28.8067762Z 2025-05-07T19:45:28.8067765Z 2025-05-07T19:45:28.8067769Z 2025-05-07T19:45:28.8067772Z 2025-05-07T19:45:28.8067776Z 2025-05-07T19:45:28.8067779Z 2025-05-07T19:45:28.8067783Z 2025-05-07T19:45:28.8067786Z 2025-05-07T19:45:28.8067790Z 2025-05-07T19:45:28.8067793Z 2025-05-07T19:45:28.8067854Z 2025-05-07T19:45:28.8067858Z 2025-05-07T19:45:28.8067862Z 2025-05-07T19:45:28.8067865Z 2025-05-07T19:45:28.8068038Z  2025-05-07T19:45:28.8068299Z 2025-05-07T19:45:28.8068303Z 2025-05-07T19:45:28.8068307Z 2025-05-07T19:45:28.8068310Z 2025-05-07T19:45:28.8068314Z 2025-05-07T19:45:28.8068317Z 2025-05-07T19:45:28.8068322Z 2025-05-07T19:45:28.8068328Z 2025-05-07T19:45:28.8068332Z 2025-05-07T19:45:28.8068335Z 2025-05-07T19:45:28.8068339Z 2025-05-07T19:45:28.8068342Z 2025-05-07T19:45:28.8068346Z 2025-05-07T19:45:28.8068350Z 2025-05-07T19:45:28.8068354Z 2025-05-07T19:45:28.8068357Z 2025-05-07T19:45:28.8068361Z 2025-05-07T19:45:28.8068388Z 2025-05-07T19:45:28.8068567Z  2025-05-07T19:45:28.8068805Z 2025-05-07T19:45:28.8068809Z 2025-05-07T19:45:28.8068916Z  2025-05-07T19:45:28.8069062Z 2025-05-07T19:45:28.8069065Z 2025-05-07T19:45:28.8069174Z  2025-05-07T19:45:28.8069295Z 2025-05-07T19:45:28.8069299Z 2025-05-07T19:45:28.8069302Z 2025-05-07T19:45:28.8069435Z  2025-05-07T19:45:28.8069558Z 2025-05-07T19:45:28.8069561Z 2025-05-07T19:45:28.8069565Z 2025-05-07T19:45:28.8069568Z 2025-05-07T19:45:28.8069797Z  2025-05-07T19:45:28.8069952Z 2025-05-07T19:45:28.8069955Z 2025-05-07T19:45:28.8069959Z 2025-05-07T19:45:28.8069963Z 2025-05-07T19:45:28.8069970Z 2025-05-07T19:45:28.8070085Z  2025-05-07T19:45:28.8070218Z 2025-05-07T19:45:28.8070222Z 2025-05-07T19:45:28.8070249Z 2025-05-07T19:45:28.8070252Z 2025-05-07T19:45:28.8070256Z 2025-05-07T19:45:28.8070259Z 2025-05-07T19:45:28.8070421Z  2025-05-07T19:45:28.8070581Z 2025-05-07T19:45:28.8070585Z 2025-05-07T19:45:28.8070588Z 2025-05-07T19:45:28.8070591Z 2025-05-07T19:45:28.8070595Z 2025-05-07T19:45:28.8070598Z 2025-05-07T19:45:28.8070601Z 2025-05-07T19:45:28.8070725Z  2025-05-07T19:45:28.8070899Z 2025-05-07T19:45:28.8070902Z 2025-05-07T19:45:28.8070909Z 2025-05-07T19:45:28.8070913Z 2025-05-07T19:45:28.8070916Z 2025-05-07T19:45:28.8070920Z 2025-05-07T19:45:28.8070923Z 2025-05-07T19:45:28.8070926Z 2025-05-07T19:45:28.8071050Z  2025-05-07T19:45:28.8071212Z 2025-05-07T19:45:28.8071239Z 2025-05-07T19:45:28.8071242Z 2025-05-07T19:45:28.8071246Z 2025-05-07T19:45:28.8071249Z 2025-05-07T19:45:28.8071252Z 2025-05-07T19:45:28.8071314Z 2025-05-07T19:45:28.8071317Z 2025-05-07T19:45:28.8071320Z 2025-05-07T19:45:28.8071452Z  2025-05-07T19:45:28.8071619Z 2025-05-07T19:45:28.8071623Z 2025-05-07T19:45:28.8071626Z 2025-05-07T19:45:28.8071654Z 2025-05-07T19:45:28.8071658Z 2025-05-07T19:45:28.8071661Z 2025-05-07T19:45:28.8071664Z 2025-05-07T19:45:28.8071667Z 2025-05-07T19:45:28.8071671Z 2025-05-07T19:45:28.8071674Z 2025-05-07T19:45:28.8071811Z  2025-05-07T19:45:28.8071986Z 2025-05-07T19:45:28.8071990Z 2025-05-07T19:45:28.8071993Z 2025-05-07T19:45:28.8071996Z 2025-05-07T19:45:28.8072028Z 2025-05-07T19:45:28.8072032Z 2025-05-07T19:45:28.8072035Z 2025-05-07T19:45:28.8072038Z 2025-05-07T19:45:28.8072042Z 2025-05-07T19:45:28.8072045Z 2025-05-07T19:45:28.8072048Z 2025-05-07T19:45:28.8072184Z  2025-05-07T19:45:28.8072370Z 2025-05-07T19:45:28.8072374Z 2025-05-07T19:45:28.8072377Z 2025-05-07T19:45:28.8072380Z 2025-05-07T19:45:28.8072411Z 2025-05-07T19:45:28.8072414Z 2025-05-07T19:45:28.8072417Z 2025-05-07T19:45:28.8072421Z 2025-05-07T19:45:28.8072424Z 2025-05-07T19:45:28.8072428Z 2025-05-07T19:45:28.8072431Z 2025-05-07T19:45:28.8072435Z 2025-05-07T19:45:28.8072698Z  2025-05-07T19:45:28.8072897Z 2025-05-07T19:45:28.8073078Z 2025-05-07T19:45:28.8073082Z 2025-05-07T19:45:28.8073109Z 2025-05-07T19:45:28.8073113Z 2025-05-07T19:45:28.8073116Z 2025-05-07T19:45:28.8073119Z 2025-05-07T19:45:28.8073123Z 2025-05-07T19:45:28.8073126Z 2025-05-07T19:45:28.8073129Z 2025-05-07T19:45:28.8073203Z 2025-05-07T19:45:28.8073208Z 2025-05-07T19:45:28.8073211Z 2025-05-07T19:45:28.8073361Z  2025-05-07T19:45:28.8073598Z 2025-05-07T19:45:28.8073602Z 2025-05-07T19:45:28.8073606Z 2025-05-07T19:45:28.8073609Z 2025-05-07T19:45:28.8073612Z 2025-05-07T19:45:28.8073616Z 2025-05-07T19:45:28.8073619Z 2025-05-07T19:45:28.8073623Z 2025-05-07T19:45:28.8073626Z 2025-05-07T19:45:28.8073633Z 2025-05-07T19:45:28.8073636Z 2025-05-07T19:45:28.8073640Z 2025-05-07T19:45:28.8073643Z 2025-05-07T19:45:28.8073647Z 2025-05-07T19:45:28.8073802Z  2025-05-07T19:45:28.8074074Z 2025-05-07T19:45:28.8074077Z 2025-05-07T19:45:28.8074081Z 2025-05-07T19:45:28.8074084Z 2025-05-07T19:45:28.8074088Z 2025-05-07T19:45:28.8074091Z 2025-05-07T19:45:28.8074095Z 2025-05-07T19:45:28.8074098Z 2025-05-07T19:45:28.8074102Z 2025-05-07T19:45:28.8074105Z 2025-05-07T19:45:28.8074109Z 2025-05-07T19:45:28.8074112Z 2025-05-07T19:45:28.8074116Z 2025-05-07T19:45:28.8074122Z 2025-05-07T19:45:28.8074126Z 2025-05-07T19:45:28.8074313Z  2025-05-07T19:45:28.8074534Z 2025-05-07T19:45:28.8074538Z 2025-05-07T19:45:28.8074541Z 2025-05-07T19:45:28.8074545Z 2025-05-07T19:45:28.8074548Z 2025-05-07T19:45:28.8074552Z 2025-05-07T19:45:28.8074555Z 2025-05-07T19:45:28.8074558Z 2025-05-07T19:45:28.8074563Z 2025-05-07T19:45:28.8074569Z 2025-05-07T19:45:28.8074573Z 2025-05-07T19:45:28.8074576Z 2025-05-07T19:45:28.8074580Z 2025-05-07T19:45:28.8074583Z 2025-05-07T19:45:28.8074586Z 2025-05-07T19:45:28.8074590Z 2025-05-07T19:45:28.8074784Z  2025-05-07T19:45:28.8075010Z 2025-05-07T19:45:28.8075014Z 2025-05-07T19:45:28.8075018Z 2025-05-07T19:45:28.8075021Z 2025-05-07T19:45:28.8075025Z 2025-05-07T19:45:28.8075028Z 2025-05-07T19:45:28.8075032Z 2025-05-07T19:45:28.8075035Z 2025-05-07T19:45:28.8075039Z 2025-05-07T19:45:28.8075042Z 2025-05-07T19:45:28.8075046Z 2025-05-07T19:45:28.8075080Z 2025-05-07T19:45:28.8075083Z 2025-05-07T19:45:28.8075088Z 2025-05-07T19:45:28.8075091Z 2025-05-07T19:45:28.8075095Z 2025-05-07T19:45:28.8075098Z 2025-05-07T19:45:28.8075267Z  2025-05-07T19:45:28.8075500Z 2025-05-07T19:45:28.8075504Z 2025-05-07T19:45:28.8075507Z 2025-05-07T19:45:28.8075510Z 2025-05-07T19:45:28.8075542Z 2025-05-07T19:45:28.8075603Z 2025-05-07T19:45:28.8075606Z 2025-05-07T19:45:28.8075610Z 2025-05-07T19:45:28.8075613Z 2025-05-07T19:45:28.8075617Z 2025-05-07T19:45:28.8075620Z 2025-05-07T19:45:28.8075624Z 2025-05-07T19:45:28.8075627Z 2025-05-07T19:45:28.8075630Z 2025-05-07T19:45:28.8075633Z 2025-05-07T19:45:28.8075637Z 2025-05-07T19:45:28.8075640Z 2025-05-07T19:45:28.8075644Z 2025-05-07T19:45:28.8075820Z  2025-05-07T19:45:28.8076079Z 2025-05-07T19:45:28.8076083Z 2025-05-07T19:45:28.8076195Z  2025-05-07T19:45:28.8076314Z 2025-05-07T19:45:28.8076321Z 2025-05-07T19:45:28.8076455Z  2025-05-07T19:45:28.8076579Z 2025-05-07T19:45:28.8076583Z 2025-05-07T19:45:28.8076586Z 2025-05-07T19:45:28.8076704Z  done 2025-05-07T19:45:29.1212136Z Preparing transaction: | / - done 2025-05-07T19:45:32.6823353Z Verifying transaction: | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | done 2025-05-07T19:45:35.3059670Z Executing transaction: - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ done 2025-05-07T19:45:35.7207201Z [INSTALL] Adding symlink librhash.so.0, which is needed by CMake ... 2025-05-07T19:45:37.5706577Z + ln -s /github/home/miniconda/envs/build_binary/lib/librhash.so /github/home/miniconda/envs/build_binary/lib/librhash.so.0 2025-05-07T19:45:37.5707186Z 2025-05-07T19:45:37.5716889Z 2025-05-07T19:45:37.5741027Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install build 2025-05-07T19:45:39.9702165Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:45:39.9703742Z 2025-05-07T19:45:39.9703897Z Collecting build 2025-05-07T19:45:39.9704281Z Downloading build-1.2.2.post1-py3-none-any.whl.metadata (6.5 kB) 2025-05-07T19:45:39.9705175Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build) (25.0) 2025-05-07T19:45:39.9705941Z Collecting pyproject_hooks (from build) 2025-05-07T19:45:39.9706432Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl.metadata (1.3 kB) 2025-05-07T19:45:39.9707290Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build) (2.2.1) 2025-05-07T19:45:39.9708106Z Downloading build-1.2.2.post1-py3-none-any.whl (22 kB) 2025-05-07T19:45:39.9708611Z Downloading pyproject_hooks-1.2.0-py3-none-any.whl (10 kB) 2025-05-07T19:45:39.9709077Z Installing collected packages: pyproject_hooks, build 2025-05-07T19:45:39.9709355Z 2025-05-07T19:45:39.9709597Z Successfully installed build-1.2.2.post1 pyproject_hooks-1.2.0 2025-05-07T19:45:39.9709929Z 2025-05-07T19:45:41.8703835Z /github/home/miniconda/envs/build_binary/bin/make 2025-05-07T19:45:41.8704352Z 2025-05-07T19:45:41.9334548Z [CHECK] Binary make found in PATH 2025-05-07T19:45:43.7260577Z /github/home/miniconda/envs/build_binary/bin/cmake 2025-05-07T19:45:43.7261397Z 2025-05-07T19:45:43.7832311Z [CHECK] Binary cmake found in PATH 2025-05-07T19:45:45.5735546Z /github/home/miniconda/envs/build_binary/bin/ninja 2025-05-07T19:45:45.5735971Z 2025-05-07T19:45:45.6314395Z [CHECK] Binary ninja found in PATH 2025-05-07T19:45:47.5195408Z [CHECK] Python (sub-)package 'click' found ... 2025-05-07T19:45:49.5440844Z [CHECK] Python (sub-)package 'hypothesis' found ... 2025-05-07T19:45:51.4742205Z [CHECK] Python (sub-)package 'jinja2' found ... 2025-05-07T19:45:53.4956925Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:45:55.3586068Z [CHECK] Python (sub-)package 'wheel' found ... 2025-05-07T19:45:55.3587320Z [INSTALL] Successfully installed all the build tools 2025-05-07T19:45:55.3662994Z ##[group]Run . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:55.3663520Z . $PRELUDE; install_cuda $BUILD_ENV 11.8.0 2025-05-07T19:45:55.3664249Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:45:55.3664625Z env: 2025-05-07T19:45:55.3664879Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:45:55.3665247Z BUILD_ENV: build_binary 2025-05-07T19:45:55.3665537Z BUILD_TARGET: default 2025-05-07T19:45:55.3665824Z BUILD_VARIANT: cuda 2025-05-07T19:45:55.3666078Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:45:55.3666374Z ##[endgroup] 2025-05-07T19:45:55.7812123Z ################################################################################ 2025-05-07T19:45:55.7813196Z # Install CUDA 2025-05-07T19:45:55.7813802Z # 2025-05-07T19:45:55.7836086Z # [2025-05-07T19:45:55.782Z] + install_cuda build_binary 11.8.0 2025-05-07T19:45:55.7836566Z ################################################################################ 2025-05-07T19:45:55.7836908Z 2025-05-07T19:45:55.7859273Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:45:55.8712931Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:45:55.8713705Z [SETUP] Cleaning up Conda packages ... 2025-05-07T19:45:55.8714127Z + conda clean --packages --tarball -y 2025-05-07T19:45:55.8714354Z 2025-05-07T19:45:56.4066598Z Will remove 147 (616.0 MB) tarball(s). 2025-05-07T19:45:56.4067565Z Will remove 21 (80.4 MB) package(s). 2025-05-07T19:45:56.4628266Z 2025-05-07T19:45:56.4633321Z + conda clean --all -y 2025-05-07T19:45:56.4633863Z 2025-05-07T19:45:57.0595181Z There are no unused tarball(s) to remove. 2025-05-07T19:45:57.0596207Z Will remove 1 index cache(s). 2025-05-07T19:45:57.0597101Z There are no unused package(s) to remove. 2025-05-07T19:45:57.0597927Z There are no tempfile(s) to remove. 2025-05-07T19:45:57.0598270Z There are no logfile(s) to remove. 2025-05-07T19:45:57.1150192Z 2025-05-07T19:45:57.1158009Z [INSTALL] Installing CUDA 11.8.0 ... 2025-05-07T19:45:57.1185116Z [EXEC] [ATTEMPT 0/3] + conda install --force-reinstall -n build_binary -c nvidia/label/cuda-11.8.0 -y cuda 2025-05-07T19:45:58.1435797Z Channels: 2025-05-07T19:45:58.1436549Z - nvidia/label/cuda-11.8.0 2025-05-07T19:45:58.1437383Z - defaults 2025-05-07T19:45:58.1437645Z Platform: linux-64 2025-05-07T19:45:59.2938457Z Collecting package metadata (repodata.json): - \ | / - \ | done 2025-05-07T19:45:59.5046045Z Solving environment: - \ done 2025-05-07T19:45:59.6283353Z 2025-05-07T19:45:59.6283910Z ## Package Plan ## 2025-05-07T19:45:59.6284388Z 2025-05-07T19:45:59.6285225Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:45:59.6285576Z 2025-05-07T19:45:59.6285691Z added / updated specs: 2025-05-07T19:45:59.6285988Z - cuda 2025-05-07T19:45:59.6286116Z 2025-05-07T19:45:59.6286120Z 2025-05-07T19:45:59.6286257Z The following packages will be downloaded: 2025-05-07T19:45:59.6286496Z 2025-05-07T19:45:59.6286671Z package | build 2025-05-07T19:45:59.6287035Z ---------------------------|----------------- 2025-05-07T19:45:59.6287477Z cuda-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6287985Z cuda-cccl-11.8.89 | 0 1.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6288542Z cuda-command-line-tools-11.8.0| 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6289164Z cuda-compiler-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6289685Z cuda-cudart-11.8.89 | 0 197 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6290236Z cuda-cudart-dev-11.8.89 | 0 1.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6290781Z cuda-cuobjdump-11.8.86 | 0 229 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6291643Z cuda-cupti-11.8.87 | 0 25.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6292351Z cuda-cuxxfilt-11.8.86 | 0 291 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6292890Z cuda-demo-suite-11.8.86 | 0 5.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6293484Z cuda-documentation-11.8.86 | 0 89 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6294048Z cuda-driver-dev-11.8.89 | 0 16 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6311918Z cuda-gdb-11.8.86 | 0 4.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6312581Z cuda-libraries-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6313349Z cuda-libraries-dev-11.8.0 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6313944Z cuda-memcheck-11.8.86 | 0 168 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6314519Z cuda-nsight-11.8.86 | 0 113.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6315090Z cuda-nsight-compute-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6315670Z cuda-nvcc-11.8.89 | 0 50.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6316204Z cuda-nvdisasm-11.8.86 | 0 48.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6316724Z cuda-nvml-dev-11.8.86 | 0 83 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6317263Z cuda-nvprof-11.8.87 | 0 4.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6317786Z cuda-nvprune-11.8.86 | 0 65 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6318329Z cuda-nvrtc-11.8.89 | 0 19.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6318879Z cuda-nvrtc-dev-11.8.89 | 0 17.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6319499Z cuda-nvtx-11.8.86 | 0 57 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6319975Z cuda-nvvp-11.8.87 | 0 114.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6320460Z cuda-profiler-api-11.8.86 | 0 18 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6320985Z cuda-runtime-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6321491Z cuda-sanitizer-api-11.8.86 | 0 16.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6322018Z cuda-toolkit-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6322510Z cuda-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6323002Z cuda-visual-tools-11.8.0 | 0 1 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6323524Z gds-tools-1.4.0.31 | 0 2 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6324003Z libcublas-11.11.3.6 | 0 364.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6324555Z libcublas-dev-11.11.3.6 | 0 394.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6325064Z libcufft-10.9.0.58 | 0 142.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6325522Z libcufft-dev-10.9.0.58 | 0 275.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6326024Z libcufile-1.4.0.31 | 0 548 KB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6326519Z libcufile-dev-1.4.0.31 | 0 1.6 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6327001Z libcurand-10.3.0.86 | 0 53.2 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6327702Z libcurand-dev-10.3.0.86 | 0 53.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6328279Z libcusolver-11.4.1.48 | 0 96.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6328799Z libcusolver-dev-11.4.1.48 | 0 66.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6329316Z libcusparse-11.7.5.86 | 0 176.3 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6329817Z libcusparse-dev-11.7.5.86 | 0 359.7 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6330319Z libnpp-11.8.0.86 | 0 147.8 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6330782Z libnpp-dev-11.8.0.86 | 0 144.5 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6331670Z libnvjpeg-11.9.0.86 | 0 2.4 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6332229Z libnvjpeg-dev-11.9.0.86 | 0 2.1 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6332797Z nsight-compute-2022.3.0.22 | 0 610.0 MB nvidia/label/cuda-11.8.0 2025-05-07T19:45:59.6333318Z ------------------------------------------------------------ 2025-05-07T19:45:59.6333692Z Total: 3.24 GB 2025-05-07T19:45:59.6333955Z 2025-05-07T19:45:59.6334101Z The following NEW packages will be INSTALLED: 2025-05-07T19:45:59.6334351Z 2025-05-07T19:45:59.6334580Z cuda nvidia/label/cuda-11.8.0/linux-64::cuda-11.8.0-0 2025-05-07T19:45:59.6335076Z cuda-cccl nvidia/label/cuda-11.8.0/linux-64::cuda-cccl-11.8.89-0 2025-05-07T19:45:59.6335737Z cuda-command-line~ nvidia/label/cuda-11.8.0/linux-64::cuda-command-line-tools-11.8.0-0 2025-05-07T19:45:59.6336416Z cuda-compiler nvidia/label/cuda-11.8.0/linux-64::cuda-compiler-11.8.0-0 2025-05-07T19:45:59.6337033Z cuda-cudart nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-11.8.89-0 2025-05-07T19:45:59.6337665Z cuda-cudart-dev nvidia/label/cuda-11.8.0/linux-64::cuda-cudart-dev-11.8.89-0 2025-05-07T19:45:59.6338311Z cuda-cuobjdump nvidia/label/cuda-11.8.0/linux-64::cuda-cuobjdump-11.8.86-0 2025-05-07T19:45:59.6338932Z cuda-cupti nvidia/label/cuda-11.8.0/linux-64::cuda-cupti-11.8.87-0 2025-05-07T19:45:59.6339526Z cuda-cuxxfilt nvidia/label/cuda-11.8.0/linux-64::cuda-cuxxfilt-11.8.86-0 2025-05-07T19:45:59.6340169Z cuda-demo-suite nvidia/label/cuda-11.8.0/linux-64::cuda-demo-suite-11.8.86-0 2025-05-07T19:45:59.6340852Z cuda-documentation nvidia/label/cuda-11.8.0/linux-64::cuda-documentation-11.8.86-0 2025-05-07T19:45:59.6341535Z cuda-driver-dev nvidia/label/cuda-11.8.0/linux-64::cuda-driver-dev-11.8.89-0 2025-05-07T19:45:59.6342122Z cuda-gdb nvidia/label/cuda-11.8.0/linux-64::cuda-gdb-11.8.86-0 2025-05-07T19:45:59.6342729Z cuda-libraries nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-11.8.0-0 2025-05-07T19:45:59.6343420Z cuda-libraries-dev nvidia/label/cuda-11.8.0/linux-64::cuda-libraries-dev-11.8.0-0 2025-05-07T19:45:59.6344193Z cuda-memcheck nvidia/label/cuda-11.8.0/linux-64::cuda-memcheck-11.8.86-0 2025-05-07T19:45:59.6344890Z cuda-nsight nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-11.8.86-0 2025-05-07T19:45:59.6345483Z cuda-nsight-compu~ nvidia/label/cuda-11.8.0/linux-64::cuda-nsight-compute-11.8.0-0 2025-05-07T19:45:59.6346083Z cuda-nvcc nvidia/label/cuda-11.8.0/linux-64::cuda-nvcc-11.8.89-0 2025-05-07T19:45:59.6346639Z cuda-nvdisasm nvidia/label/cuda-11.8.0/linux-64::cuda-nvdisasm-11.8.86-0 2025-05-07T19:45:59.6347198Z cuda-nvml-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvml-dev-11.8.86-0 2025-05-07T19:45:59.6347772Z cuda-nvprof nvidia/label/cuda-11.8.0/linux-64::cuda-nvprof-11.8.87-0 2025-05-07T19:45:59.6348318Z cuda-nvprune nvidia/label/cuda-11.8.0/linux-64::cuda-nvprune-11.8.86-0 2025-05-07T19:45:59.6348879Z cuda-nvrtc nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-11.8.89-0 2025-05-07T19:45:59.6350992Z cuda-nvrtc-dev nvidia/label/cuda-11.8.0/linux-64::cuda-nvrtc-dev-11.8.89-0 2025-05-07T19:45:59.6351678Z cuda-nvtx nvidia/label/cuda-11.8.0/linux-64::cuda-nvtx-11.8.86-0 2025-05-07T19:45:59.6352210Z cuda-nvvp nvidia/label/cuda-11.8.0/linux-64::cuda-nvvp-11.8.87-0 2025-05-07T19:45:59.6353092Z cuda-profiler-api nvidia/label/cuda-11.8.0/linux-64::cuda-profiler-api-11.8.86-0 2025-05-07T19:45:59.6353821Z cuda-runtime nvidia/label/cuda-11.8.0/linux-64::cuda-runtime-11.8.0-0 2025-05-07T19:45:59.6354491Z cuda-sanitizer-api nvidia/label/cuda-11.8.0/linux-64::cuda-sanitizer-api-11.8.86-0 2025-05-07T19:45:59.6355138Z cuda-toolkit nvidia/label/cuda-11.8.0/linux-64::cuda-toolkit-11.8.0-0 2025-05-07T19:45:59.6355733Z cuda-tools nvidia/label/cuda-11.8.0/linux-64::cuda-tools-11.8.0-0 2025-05-07T19:45:59.6356367Z cuda-visual-tools nvidia/label/cuda-11.8.0/linux-64::cuda-visual-tools-11.8.0-0 2025-05-07T19:45:59.6356983Z gds-tools nvidia/label/cuda-11.8.0/linux-64::gds-tools-1.4.0.31-0 2025-05-07T19:45:59.6357558Z libcublas nvidia/label/cuda-11.8.0/linux-64::libcublas-11.11.3.6-0 2025-05-07T19:45:59.6358138Z libcublas-dev nvidia/label/cuda-11.8.0/linux-64::libcublas-dev-11.11.3.6-0 2025-05-07T19:45:59.6358747Z libcufft nvidia/label/cuda-11.8.0/linux-64::libcufft-10.9.0.58-0 2025-05-07T19:45:59.6359341Z libcufft-dev nvidia/label/cuda-11.8.0/linux-64::libcufft-dev-10.9.0.58-0 2025-05-07T19:45:59.6359917Z libcufile nvidia/label/cuda-11.8.0/linux-64::libcufile-1.4.0.31-0 2025-05-07T19:45:59.6360521Z libcufile-dev nvidia/label/cuda-11.8.0/linux-64::libcufile-dev-1.4.0.31-0 2025-05-07T19:45:59.6361109Z libcurand nvidia/label/cuda-11.8.0/linux-64::libcurand-10.3.0.86-0 2025-05-07T19:45:59.6361724Z libcurand-dev nvidia/label/cuda-11.8.0/linux-64::libcurand-dev-10.3.0.86-0 2025-05-07T19:45:59.6362361Z libcusolver nvidia/label/cuda-11.8.0/linux-64::libcusolver-11.4.1.48-0 2025-05-07T19:45:59.6362991Z libcusolver-dev nvidia/label/cuda-11.8.0/linux-64::libcusolver-dev-11.4.1.48-0 2025-05-07T19:45:59.6363637Z libcusparse nvidia/label/cuda-11.8.0/linux-64::libcusparse-11.7.5.86-0 2025-05-07T19:45:59.6364264Z libcusparse-dev nvidia/label/cuda-11.8.0/linux-64::libcusparse-dev-11.7.5.86-0 2025-05-07T19:45:59.6364880Z libnpp nvidia/label/cuda-11.8.0/linux-64::libnpp-11.8.0.86-0 2025-05-07T19:45:59.6365561Z libnpp-dev nvidia/label/cuda-11.8.0/linux-64::libnpp-dev-11.8.0.86-0 2025-05-07T19:45:59.6366094Z libnvjpeg nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-11.9.0.86-0 2025-05-07T19:45:59.6366679Z libnvjpeg-dev nvidia/label/cuda-11.8.0/linux-64::libnvjpeg-dev-11.9.0.86-0 2025-05-07T19:45:59.6367282Z nsight-compute nvidia/label/cuda-11.8.0/linux-64::nsight-compute-2022.3.0.22-0 2025-05-07T19:45:59.6367685Z 2025-05-07T19:45:59.6380016Z 2025-05-07T19:45:59.6380561Z 2025-05-07T19:45:59.6381248Z Downloading and Extracting Packages: ...working... 2025-05-07T19:45:59.6383726Z nsight-compute-2022. | 610.0 MB | | 0% 2025-05-07T19:45:59.6384022Z 2025-05-07T19:45:59.6399329Z libcublas-dev-11.11. | 394.1 MB | | 0%  2025-05-07T19:45:59.6400155Z 2025-05-07T19:45:59.6400200Z 2025-05-07T19:45:59.6405926Z libcublas-11.11.3.6 | 364.0 MB | | 0%  2025-05-07T19:45:59.6406218Z 2025-05-07T19:45:59.6406224Z 2025-05-07T19:45:59.6408171Z 2025-05-07T19:45:59.6419223Z libcusparse-dev-11.7 | 359.7 MB | | 0%  2025-05-07T19:45:59.6420079Z 2025-05-07T19:45:59.6420093Z 2025-05-07T19:45:59.6420106Z 2025-05-07T19:45:59.6420119Z 2025-05-07T19:45:59.6454694Z libcufft-dev-10.9.0. | 275.8 MB | | 0%  2025-05-07T19:45:59.6455683Z 2025-05-07T19:45:59.6455698Z 2025-05-07T19:45:59.6455711Z 2025-05-07T19:45:59.6455722Z 2025-05-07T19:45:59.6455734Z 2025-05-07T19:45:59.6467685Z libcusparse-11.7.5.8 | 176.3 MB | | 0%  2025-05-07T19:45:59.6468176Z 2025-05-07T19:45:59.6468180Z 2025-05-07T19:45:59.6468201Z 2025-05-07T19:45:59.6468204Z 2025-05-07T19:45:59.6468208Z 2025-05-07T19:45:59.6468211Z 2025-05-07T19:45:59.6468491Z libnpp-11.8.0.86 | 147.8 MB | | 0%  2025-05-07T19:45:59.6468820Z 2025-05-07T19:45:59.6468823Z 2025-05-07T19:45:59.6468827Z 2025-05-07T19:45:59.6468830Z 2025-05-07T19:45:59.6468834Z 2025-05-07T19:45:59.6468837Z 2025-05-07T19:45:59.6468841Z 2025-05-07T19:45:59.6469115Z libnpp-dev-11.8.0.86 | 144.5 MB | | 0%  2025-05-07T19:45:59.6469433Z 2025-05-07T19:45:59.6469437Z 2025-05-07T19:45:59.6469468Z 2025-05-07T19:45:59.6469471Z 2025-05-07T19:45:59.6469474Z 2025-05-07T19:45:59.6469478Z 2025-05-07T19:45:59.6469481Z 2025-05-07T19:45:59.6469485Z 2025-05-07T19:45:59.6469767Z libcufft-10.9.0.58 | 142.8 MB | | 0%  2025-05-07T19:45:59.6470081Z 2025-05-07T19:45:59.6470085Z 2025-05-07T19:45:59.6470123Z 2025-05-07T19:45:59.6470132Z 2025-05-07T19:45:59.6470135Z 2025-05-07T19:45:59.6470139Z 2025-05-07T19:45:59.6470142Z 2025-05-07T19:45:59.6470145Z 2025-05-07T19:45:59.6470149Z 2025-05-07T19:45:59.6470437Z cuda-nvvp-11.8.87 | 114.4 MB | | 0%  2025-05-07T19:45:59.6470741Z 2025-05-07T19:45:59.6470745Z 2025-05-07T19:45:59.6470775Z 2025-05-07T19:45:59.6470778Z 2025-05-07T19:45:59.6470782Z 2025-05-07T19:45:59.6470785Z 2025-05-07T19:45:59.6470788Z 2025-05-07T19:45:59.6470792Z 2025-05-07T19:45:59.6470795Z 2025-05-07T19:45:59.6470798Z 2025-05-07T19:45:59.6471275Z cuda-nsight-11.8.86 | 113.6 MB | | 0%  2025-05-07T19:45:59.6471603Z 2025-05-07T19:45:59.6471652Z 2025-05-07T19:45:59.6471655Z 2025-05-07T19:45:59.6471658Z 2025-05-07T19:45:59.6471662Z 2025-05-07T19:45:59.6471665Z 2025-05-07T19:45:59.6471669Z 2025-05-07T19:45:59.6471672Z 2025-05-07T19:45:59.6471675Z 2025-05-07T19:45:59.6471679Z 2025-05-07T19:45:59.6471686Z 2025-05-07T19:45:59.6472358Z libcusolver-11.4.1.4 | 96.5 MB | | 0%  2025-05-07T19:45:59.6472853Z 2025-05-07T19:45:59.6472857Z 2025-05-07T19:45:59.6472860Z 2025-05-07T19:45:59.6472864Z 2025-05-07T19:45:59.6472867Z 2025-05-07T19:45:59.6472870Z 2025-05-07T19:45:59.6472974Z 2025-05-07T19:45:59.6473332Z 2025-05-07T19:45:59.6473342Z 2025-05-07T19:45:59.6473348Z 2025-05-07T19:45:59.6473353Z 2025-05-07T19:45:59.6473358Z 2025-05-07T19:45:59.6474089Z libcusolver-dev-11.4 | 66.3 MB | | 0%  2025-05-07T19:45:59.6474459Z 2025-05-07T19:45:59.6474463Z 2025-05-07T19:45:59.6474468Z 2025-05-07T19:45:59.6474500Z 2025-05-07T19:45:59.6474504Z 2025-05-07T19:45:59.6474508Z 2025-05-07T19:45:59.6474513Z 2025-05-07T19:45:59.6474517Z 2025-05-07T19:45:59.6474522Z 2025-05-07T19:45:59.6474527Z 2025-05-07T19:45:59.6474531Z 2025-05-07T19:45:59.6474536Z 2025-05-07T19:45:59.6474558Z 2025-05-07T19:45:59.6474897Z libcurand-dev-10.3.0 | 53.7 MB | | 0%  2025-05-07T19:45:59.6475278Z 2025-05-07T19:45:59.6475281Z 2025-05-07T19:45:59.6475285Z 2025-05-07T19:45:59.6475289Z 2025-05-07T19:45:59.6475292Z 2025-05-07T19:45:59.6475296Z 2025-05-07T19:45:59.6475299Z 2025-05-07T19:45:59.6475303Z 2025-05-07T19:45:59.6475306Z 2025-05-07T19:45:59.6475309Z 2025-05-07T19:45:59.6475313Z 2025-05-07T19:45:59.6475316Z 2025-05-07T19:45:59.6475320Z 2025-05-07T19:45:59.6475333Z 2025-05-07T19:45:59.6475857Z libcurand-10.3.0.86 | 53.2 MB | | 0%  2025-05-07T19:45:59.6476195Z 2025-05-07T19:45:59.6476199Z 2025-05-07T19:45:59.6476215Z 2025-05-07T19:45:59.6476219Z 2025-05-07T19:45:59.6476222Z 2025-05-07T19:45:59.6476226Z 2025-05-07T19:45:59.6476229Z 2025-05-07T19:45:59.6476233Z 2025-05-07T19:45:59.6476237Z 2025-05-07T19:45:59.6476240Z 2025-05-07T19:45:59.6476243Z 2025-05-07T19:45:59.6476247Z 2025-05-07T19:45:59.6477585Z 2025-05-07T19:45:59.6477684Z 2025-05-07T19:45:59.6477720Z 2025-05-07T19:45:59.6478110Z cuda-nvcc-11.8.89 | 50.8 MB | | 0%  2025-05-07T19:45:59.6478443Z 2025-05-07T19:45:59.6478446Z 2025-05-07T19:45:59.6478450Z 2025-05-07T19:45:59.6478454Z 2025-05-07T19:45:59.6478457Z 2025-05-07T19:45:59.6478460Z 2025-05-07T19:45:59.6478487Z 2025-05-07T19:45:59.6478490Z 2025-05-07T19:45:59.6478494Z 2025-05-07T19:45:59.6478497Z 2025-05-07T19:45:59.6478501Z 2025-05-07T19:45:59.6478504Z 2025-05-07T19:45:59.6478507Z 2025-05-07T19:45:59.6478511Z 2025-05-07T19:45:59.6478514Z 2025-05-07T19:45:59.6478518Z 2025-05-07T19:45:59.6478839Z cuda-nvdisasm-11.8.8 | 48.7 MB | | 0%  2025-05-07T19:45:59.6479214Z 2025-05-07T19:45:59.6479218Z 2025-05-07T19:45:59.6479221Z 2025-05-07T19:45:59.6479225Z 2025-05-07T19:45:59.6479228Z 2025-05-07T19:45:59.6479232Z 2025-05-07T19:45:59.6479235Z 2025-05-07T19:45:59.6479248Z 2025-05-07T19:45:59.6479252Z 2025-05-07T19:45:59.6479255Z 2025-05-07T19:45:59.6479272Z 2025-05-07T19:45:59.6479276Z 2025-05-07T19:45:59.6479279Z 2025-05-07T19:45:59.6479282Z 2025-05-07T19:45:59.6479286Z 2025-05-07T19:45:59.6479289Z 2025-05-07T19:45:59.6479292Z 2025-05-07T19:45:59.6479652Z cuda-cupti-11.8.87 | 25.3 MB | | 0%  2025-05-07T19:45:59.6479992Z 2025-05-07T19:45:59.6479996Z 2025-05-07T19:45:59.6479999Z 2025-05-07T19:45:59.6480003Z 2025-05-07T19:45:59.6480006Z 2025-05-07T19:45:59.6480009Z 2025-05-07T19:45:59.6480013Z 2025-05-07T19:45:59.6480016Z 2025-05-07T19:45:59.6480020Z 2025-05-07T19:45:59.6480023Z 2025-05-07T19:45:59.6480027Z 2025-05-07T19:45:59.6480030Z 2025-05-07T19:45:59.6480034Z 2025-05-07T19:45:59.6480037Z 2025-05-07T19:45:59.6480065Z 2025-05-07T19:45:59.6480069Z 2025-05-07T19:45:59.6480072Z 2025-05-07T19:45:59.6480076Z 2025-05-07T19:45:59.6480408Z cuda-nvrtc-11.8.89 | 19.1 MB | | 0%  2025-05-07T19:45:59.6480752Z 2025-05-07T19:45:59.6480756Z 2025-05-07T19:45:59.6480760Z 2025-05-07T19:45:59.6480790Z 2025-05-07T19:45:59.6480793Z 2025-05-07T19:45:59.6480797Z 2025-05-07T19:45:59.6480800Z 2025-05-07T19:45:59.6480803Z 2025-05-07T19:45:59.6480807Z 2025-05-07T19:45:59.6480811Z 2025-05-07T19:45:59.6480814Z 2025-05-07T19:45:59.6480817Z 2025-05-07T19:45:59.6480821Z 2025-05-07T19:45:59.6480824Z 2025-05-07T19:45:59.6480828Z 2025-05-07T19:45:59.6480831Z 2025-05-07T19:45:59.6480841Z 2025-05-07T19:45:59.6480845Z 2025-05-07T19:45:59.6480848Z 2025-05-07T19:46:04.4902125Z ... (more hidden) ... 2025-05-07T19:46:04.4902499Z 2025-05-07T19:46:04.4902505Z 2025-05-07T19:46:04.4902508Z 2025-05-07T19:46:04.4902512Z 2025-05-07T19:46:04.4903933Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:04.4904249Z 2025-05-07T19:46:04.4904300Z 2025-05-07T19:46:04.4904323Z 2025-05-07T19:46:04.4904327Z 2025-05-07T19:46:06.2007152Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:06.2007534Z 2025-05-07T19:46:06.2007801Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:06.2008101Z 2025-05-07T19:46:08.1655435Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:08.1655775Z 2025-05-07T19:46:08.1655781Z 2025-05-07T19:46:08.1656061Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:08.1656347Z 2025-05-07T19:46:08.1656353Z 2025-05-07T19:46:08.4476178Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:08.4476546Z 2025-05-07T19:46:08.4476551Z 2025-05-07T19:46:08.4476557Z 2025-05-07T19:46:08.4476565Z 2025-05-07T19:46:08.4476569Z 2025-05-07T19:46:08.4476860Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:08.4477176Z 2025-05-07T19:46:08.4477182Z 2025-05-07T19:46:08.4477485Z 2025-05-07T19:46:08.4477672Z 2025-05-07T19:46:08.4477676Z 2025-05-07T19:46:08.7688955Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:46:08.7689297Z 2025-05-07T19:46:08.7689303Z 2025-05-07T19:46:08.7689327Z 2025-05-07T19:46:08.7689616Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:08.7689915Z 2025-05-07T19:46:08.7689920Z 2025-05-07T19:46:08.7689924Z 2025-05-07T19:46:08.8107433Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:46:08.8107796Z 2025-05-07T19:46:08.8107802Z 2025-05-07T19:46:08.8107806Z 2025-05-07T19:46:08.8107810Z 2025-05-07T19:46:08.8107813Z 2025-05-07T19:46:08.8107818Z 2025-05-07T19:46:08.8108087Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:08.8108416Z 2025-05-07T19:46:08.8108422Z 2025-05-07T19:46:08.8108428Z 2025-05-07T19:46:08.8108434Z 2025-05-07T19:46:08.8108438Z 2025-05-07T19:46:08.8108443Z 2025-05-07T19:46:10.7230250Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:46:10.7230664Z 2025-05-07T19:46:10.7230707Z 2025-05-07T19:46:10.7230711Z 2025-05-07T19:46:10.7230714Z 2025-05-07T19:46:10.7230718Z 2025-05-07T19:46:10.7230721Z 2025-05-07T19:46:10.7230725Z 2025-05-07T19:46:10.7230728Z 2025-05-07T19:46:10.7230731Z 2025-05-07T19:46:10.7230735Z 2025-05-07T19:46:10.7231065Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:10.7231606Z 2025-05-07T19:46:10.7231610Z 2025-05-07T19:46:10.7231614Z 2025-05-07T19:46:10.7231617Z 2025-05-07T19:46:10.7231621Z 2025-05-07T19:46:10.7231624Z 2025-05-07T19:46:10.7231627Z 2025-05-07T19:46:10.7231631Z 2025-05-07T19:46:10.7231634Z 2025-05-07T19:46:10.7231637Z 2025-05-07T19:46:10.8321729Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:46:10.8322281Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:46:11.4250604Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:46:11.4251015Z 2025-05-07T19:46:11.4251020Z 2025-05-07T19:46:11.4251024Z 2025-05-07T19:46:11.4251027Z 2025-05-07T19:46:11.4251058Z 2025-05-07T19:46:11.4251061Z 2025-05-07T19:46:11.4251065Z 2025-05-07T19:46:11.4251068Z 2025-05-07T19:46:11.4251072Z 2025-05-07T19:46:11.4251635Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:11.4251962Z 2025-05-07T19:46:11.4251965Z 2025-05-07T19:46:11.4251969Z 2025-05-07T19:46:11.4251972Z 2025-05-07T19:46:11.4251976Z 2025-05-07T19:46:11.4252007Z 2025-05-07T19:46:11.4252010Z 2025-05-07T19:46:11.4252014Z 2025-05-07T19:46:11.4252017Z 2025-05-07T19:46:12.0247700Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:46:12.0248083Z 2025-05-07T19:46:12.0248089Z 2025-05-07T19:46:12.0248093Z 2025-05-07T19:46:12.0248097Z 2025-05-07T19:46:12.0248105Z 2025-05-07T19:46:12.0248109Z 2025-05-07T19:46:12.0248114Z 2025-05-07T19:46:12.0248164Z 2025-05-07T19:46:12.0248210Z 2025-05-07T19:46:12.0248214Z 2025-05-07T19:46:12.0248218Z 2025-05-07T19:46:12.0248221Z 2025-05-07T19:46:12.0248569Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:46:12.0248910Z 2025-05-07T19:46:12.0248914Z 2025-05-07T19:46:12.0248917Z 2025-05-07T19:46:12.0248921Z 2025-05-07T19:46:12.0248924Z 2025-05-07T19:46:12.0248928Z 2025-05-07T19:46:12.0248952Z 2025-05-07T19:46:12.0248955Z 2025-05-07T19:46:12.0248958Z 2025-05-07T19:46:12.0248962Z 2025-05-07T19:46:12.0248965Z 2025-05-07T19:46:12.0248969Z 2025-05-07T19:46:12.0274687Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:46:12.0275055Z 2025-05-07T19:46:12.0275060Z 2025-05-07T19:46:12.0275087Z 2025-05-07T19:46:12.0275091Z 2025-05-07T19:46:12.0275095Z 2025-05-07T19:46:12.0275098Z 2025-05-07T19:46:12.0275102Z 2025-05-07T19:46:12.0275105Z 2025-05-07T19:46:12.0275638Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:12.0276091Z 2025-05-07T19:46:12.0276095Z 2025-05-07T19:46:12.0276098Z 2025-05-07T19:46:12.0276124Z 2025-05-07T19:46:12.0276128Z 2025-05-07T19:46:12.0276131Z 2025-05-07T19:46:12.0276135Z 2025-05-07T19:46:12.0276138Z 2025-05-07T19:46:12.3396975Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:46:12.3397370Z 2025-05-07T19:46:12.3397405Z 2025-05-07T19:46:12.3397410Z 2025-05-07T19:46:12.3397414Z 2025-05-07T19:46:12.3397418Z 2025-05-07T19:46:12.3397424Z 2025-05-07T19:46:12.3397428Z 2025-05-07T19:46:12.3397432Z 2025-05-07T19:46:12.3397437Z 2025-05-07T19:46:12.3397442Z 2025-05-07T19:46:12.3397448Z 2025-05-07T19:46:12.3397751Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:46:12.3398097Z 2025-05-07T19:46:12.3398102Z 2025-05-07T19:46:12.3398107Z 2025-05-07T19:46:12.3398111Z 2025-05-07T19:46:12.3398115Z 2025-05-07T19:46:12.3398153Z 2025-05-07T19:46:12.3398174Z 2025-05-07T19:46:12.3398178Z 2025-05-07T19:46:12.3398181Z 2025-05-07T19:46:12.3398185Z 2025-05-07T19:46:12.3398188Z 2025-05-07T19:46:12.4196184Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:46:12.4196606Z 2025-05-07T19:46:12.4196611Z 2025-05-07T19:46:12.4196614Z 2025-05-07T19:46:12.4196618Z 2025-05-07T19:46:12.4196622Z 2025-05-07T19:46:12.4196626Z 2025-05-07T19:46:12.4196631Z 2025-05-07T19:46:12.4196635Z 2025-05-07T19:46:12.4196641Z 2025-05-07T19:46:12.4196645Z 2025-05-07T19:46:12.4196649Z 2025-05-07T19:46:12.4196652Z 2025-05-07T19:46:12.4196656Z 2025-05-07T19:46:12.4196995Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:12.4197331Z 2025-05-07T19:46:12.4197334Z 2025-05-07T19:46:12.4197338Z 2025-05-07T19:46:12.4197342Z 2025-05-07T19:46:12.4197346Z 2025-05-07T19:46:12.4197350Z 2025-05-07T19:46:12.4197353Z 2025-05-07T19:46:12.4197379Z 2025-05-07T19:46:12.4197395Z 2025-05-07T19:46:12.4197399Z 2025-05-07T19:46:12.4197402Z 2025-05-07T19:46:12.4197426Z 2025-05-07T19:46:12.4197429Z 2025-05-07T19:46:12.9224033Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:46:12.9224466Z 2025-05-07T19:46:12.9224472Z 2025-05-07T19:46:12.9224475Z 2025-05-07T19:46:12.9224480Z 2025-05-07T19:46:12.9224483Z 2025-05-07T19:46:12.9224487Z 2025-05-07T19:46:12.9224491Z 2025-05-07T19:46:12.9224495Z 2025-05-07T19:46:12.9224499Z 2025-05-07T19:46:12.9224503Z 2025-05-07T19:46:12.9224507Z 2025-05-07T19:46:12.9224511Z 2025-05-07T19:46:12.9224514Z 2025-05-07T19:46:12.9224519Z 2025-05-07T19:46:12.9224864Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:46:12.9225223Z 2025-05-07T19:46:12.9225253Z 2025-05-07T19:46:12.9225257Z 2025-05-07T19:46:12.9225261Z 2025-05-07T19:46:12.9225265Z 2025-05-07T19:46:12.9225269Z 2025-05-07T19:46:12.9225315Z 2025-05-07T19:46:12.9225338Z 2025-05-07T19:46:12.9225342Z 2025-05-07T19:46:12.9225345Z 2025-05-07T19:46:12.9225349Z 2025-05-07T19:46:12.9225352Z 2025-05-07T19:46:12.9225356Z 2025-05-07T19:46:12.9225359Z 2025-05-07T19:46:13.0434065Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:46:13.0434453Z 2025-05-07T19:46:13.0434458Z 2025-05-07T19:46:13.0434461Z 2025-05-07T19:46:13.0434465Z 2025-05-07T19:46:13.0434468Z 2025-05-07T19:46:13.0434472Z 2025-05-07T19:46:13.0434476Z 2025-05-07T19:46:13.0434501Z 2025-05-07T19:46:13.0434506Z 2025-05-07T19:46:13.0434511Z 2025-05-07T19:46:13.0434514Z 2025-05-07T19:46:13.0434517Z 2025-05-07T19:46:13.0434521Z 2025-05-07T19:46:13.0434525Z 2025-05-07T19:46:13.0434529Z 2025-05-07T19:46:13.0434532Z 2025-05-07T19:46:13.0434868Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:46:13.0435218Z 2025-05-07T19:46:13.0435244Z 2025-05-07T19:46:13.0435505Z 2025-05-07T19:46:13.0435645Z 2025-05-07T19:46:13.0435649Z 2025-05-07T19:46:13.0435653Z 2025-05-07T19:46:13.0435656Z 2025-05-07T19:46:13.0435659Z 2025-05-07T19:46:13.0435663Z 2025-05-07T19:46:13.0435666Z 2025-05-07T19:46:13.0435669Z 2025-05-07T19:46:13.0435673Z 2025-05-07T19:46:13.0435676Z 2025-05-07T19:46:13.0435679Z 2025-05-07T19:46:13.0435683Z 2025-05-07T19:46:13.0435687Z 2025-05-07T19:46:13.0686511Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:46:13.0686921Z 2025-05-07T19:46:13.0686926Z 2025-05-07T19:46:13.0686929Z 2025-05-07T19:46:13.0686933Z 2025-05-07T19:46:13.0686936Z 2025-05-07T19:46:13.0686939Z 2025-05-07T19:46:13.0686943Z 2025-05-07T19:46:13.0686946Z 2025-05-07T19:46:13.0686950Z 2025-05-07T19:46:13.0686953Z 2025-05-07T19:46:13.0686957Z 2025-05-07T19:46:13.0686961Z 2025-05-07T19:46:13.0686964Z 2025-05-07T19:46:13.0686967Z 2025-05-07T19:46:13.0686971Z 2025-05-07T19:46:13.0687000Z 2025-05-07T19:46:13.0687147Z 2025-05-07T19:46:13.0687487Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:46:13.0687827Z 2025-05-07T19:46:13.0687830Z 2025-05-07T19:46:13.0687833Z 2025-05-07T19:46:13.0687837Z 2025-05-07T19:46:13.0687840Z 2025-05-07T19:46:13.0687844Z 2025-05-07T19:46:13.0687876Z 2025-05-07T19:46:13.0687880Z 2025-05-07T19:46:13.0687883Z 2025-05-07T19:46:13.0687887Z 2025-05-07T19:46:13.0687890Z 2025-05-07T19:46:13.0687893Z 2025-05-07T19:46:13.0687897Z 2025-05-07T19:46:13.0687900Z 2025-05-07T19:46:13.0687904Z 2025-05-07T19:46:13.0687907Z 2025-05-07T19:46:13.0687911Z 2025-05-07T19:46:13.2636015Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:46:13.2636395Z 2025-05-07T19:46:13.2636448Z 2025-05-07T19:46:13.2636454Z 2025-05-07T19:46:13.2636458Z 2025-05-07T19:46:13.2636462Z 2025-05-07T19:46:13.2636466Z 2025-05-07T19:46:13.2636472Z 2025-05-07T19:46:13.2636524Z 2025-05-07T19:46:13.2636544Z 2025-05-07T19:46:13.2636548Z 2025-05-07T19:46:13.2636551Z 2025-05-07T19:46:13.2636555Z 2025-05-07T19:46:13.2636558Z 2025-05-07T19:46:13.2636561Z 2025-05-07T19:46:13.2636565Z 2025-05-07T19:46:13.2636878Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:46:13.2637229Z 2025-05-07T19:46:13.2637232Z 2025-05-07T19:46:13.2637236Z 2025-05-07T19:46:13.2637239Z 2025-05-07T19:46:13.2637242Z 2025-05-07T19:46:13.2637246Z 2025-05-07T19:46:13.2637249Z 2025-05-07T19:46:13.2637253Z 2025-05-07T19:46:13.2637256Z 2025-05-07T19:46:13.2637259Z 2025-05-07T19:46:13.2637263Z 2025-05-07T19:46:13.2637266Z 2025-05-07T19:46:13.2637269Z 2025-05-07T19:46:13.2637273Z 2025-05-07T19:46:13.2637276Z 2025-05-07T19:46:13.3133583Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:46:13.3133955Z 2025-05-07T19:46:13.3133960Z 2025-05-07T19:46:13.3133991Z 2025-05-07T19:46:13.3134005Z 2025-05-07T19:46:13.3134009Z 2025-05-07T19:46:13.3134012Z 2025-05-07T19:46:13.3134016Z 2025-05-07T19:46:13.3134019Z 2025-05-07T19:46:13.3134023Z 2025-05-07T19:46:13.3134047Z 2025-05-07T19:46:13.3134051Z 2025-05-07T19:46:13.3134054Z 2025-05-07T19:46:13.3134058Z 2025-05-07T19:46:13.3134061Z 2025-05-07T19:46:13.3134065Z 2025-05-07T19:46:13.3134068Z 2025-05-07T19:46:13.3134072Z 2025-05-07T19:46:13.3134075Z 2025-05-07T19:46:13.3134406Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:46:13.3134775Z 2025-05-07T19:46:13.3134779Z 2025-05-07T19:46:13.3134782Z 2025-05-07T19:46:13.3134786Z 2025-05-07T19:46:13.3134789Z 2025-05-07T19:46:13.3134792Z 2025-05-07T19:46:13.3134796Z 2025-05-07T19:46:13.3134799Z 2025-05-07T19:46:13.3134803Z 2025-05-07T19:46:13.3134806Z 2025-05-07T19:46:13.3134810Z 2025-05-07T19:46:13.3134813Z 2025-05-07T19:46:13.3134817Z 2025-05-07T19:46:13.3134820Z 2025-05-07T19:46:13.3135053Z 2025-05-07T19:46:13.3135198Z 2025-05-07T19:46:13.3135202Z 2025-05-07T19:46:13.3135206Z 2025-05-07T19:46:13.4657143Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:46:13.4657560Z 2025-05-07T19:46:13.4657566Z 2025-05-07T19:46:13.4657572Z 2025-05-07T19:46:13.4657577Z 2025-05-07T19:46:13.4657582Z 2025-05-07T19:46:13.4657587Z 2025-05-07T19:46:13.4657592Z 2025-05-07T19:46:13.4657596Z 2025-05-07T19:46:13.4657600Z 2025-05-07T19:46:13.4657605Z 2025-05-07T19:46:13.4657610Z 2025-05-07T19:46:13.4657636Z 2025-05-07T19:46:13.4657639Z 2025-05-07T19:46:13.4657642Z 2025-05-07T19:46:13.4657646Z 2025-05-07T19:46:13.4657649Z 2025-05-07T19:46:13.4657653Z 2025-05-07T19:46:13.4657656Z 2025-05-07T19:46:13.4657660Z 2025-05-07T19:46:13.4657933Z ... (more hidden) ... 2025-05-07T19:46:13.4658237Z 2025-05-07T19:46:13.4658241Z 2025-05-07T19:46:13.4658270Z 2025-05-07T19:46:13.4658307Z 2025-05-07T19:46:13.4658325Z 2025-05-07T19:46:13.4658329Z 2025-05-07T19:46:13.4658332Z 2025-05-07T19:46:13.4658336Z 2025-05-07T19:46:13.4658339Z 2025-05-07T19:46:13.4658342Z 2025-05-07T19:46:13.4658346Z 2025-05-07T19:46:13.4658349Z 2025-05-07T19:46:13.4658353Z 2025-05-07T19:46:13.4658356Z 2025-05-07T19:46:13.4658359Z 2025-05-07T19:46:13.4658363Z 2025-05-07T19:46:13.4658366Z 2025-05-07T19:46:13.4658370Z 2025-05-07T19:46:13.4658373Z 2025-05-07T19:46:13.6373737Z ... (more hidden) ... 2025-05-07T19:46:13.6374109Z 2025-05-07T19:46:13.6374114Z 2025-05-07T19:46:13.6374117Z 2025-05-07T19:46:13.6374122Z 2025-05-07T19:46:13.6374152Z 2025-05-07T19:46:13.6374156Z 2025-05-07T19:46:13.6374160Z 2025-05-07T19:46:13.6374476Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:13.6374791Z 2025-05-07T19:46:13.6374796Z 2025-05-07T19:46:13.6374802Z 2025-05-07T19:46:13.6374807Z 2025-05-07T19:46:13.6374856Z 2025-05-07T19:46:13.6374873Z 2025-05-07T19:46:13.6374877Z 2025-05-07T19:46:35.1177578Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:46:35.1179709Z 2025-05-07T19:46:35.1181056Z 2025-05-07T19:46:35.1181067Z 2025-05-07T19:46:35.1181072Z 2025-05-07T19:46:44.1871459Z libcufft-dev-10.9.0. | 275.8 MB | ########## | 100%  2025-05-07T19:46:44.1871794Z 2025-05-07T19:46:44.1871802Z 2025-05-07T19:46:48.8472462Z libcublas-11.11.3.6 | 364.0 MB | ########## | 100%  2025-05-07T19:46:48.8472858Z 2025-05-07T19:46:52.9369560Z libcublas-dev-11.11. | 394.1 MB | ########## | 100%  2025-05-07T19:46:52.9369905Z 2025-05-07T19:46:52.9369911Z 2025-05-07T19:46:52.9369944Z 2025-05-07T19:46:52.9369967Z 2025-05-07T19:46:52.9369970Z 2025-05-07T19:47:02.5466649Z libcusparse-11.7.5.8 | 176.3 MB | ########## | 100%  2025-05-07T19:47:02.5467027Z 2025-05-07T19:47:02.5467058Z 2025-05-07T19:47:02.5467062Z 2025-05-07T19:47:02.5467115Z 2025-05-07T19:47:02.5467141Z 2025-05-07T19:47:02.5467145Z 2025-05-07T19:47:02.5468349Z 2025-05-07T19:47:02.5468382Z 2025-05-07T19:47:02.5468385Z 2025-05-07T19:47:02.5468389Z 2025-05-07T19:47:03.2792959Z cuda-nsight-11.8.86 | 113.6 MB | ########## | 100%  2025-05-07T19:47:03.2793327Z 2025-05-07T19:47:03.2793332Z 2025-05-07T19:47:03.2793336Z 2025-05-07T19:47:03.2793339Z 2025-05-07T19:47:03.2793344Z 2025-05-07T19:47:03.2793349Z 2025-05-07T19:47:11.7348443Z libnpp-11.8.0.86 | 147.8 MB | ########## | 100%  2025-05-07T19:47:11.7348862Z 2025-05-07T19:47:11.7348867Z 2025-05-07T19:47:11.7348871Z 2025-05-07T19:47:11.7348877Z 2025-05-07T19:47:11.7348881Z 2025-05-07T19:47:11.7348887Z 2025-05-07T19:47:11.7348891Z 2025-05-07T19:47:11.7348896Z 2025-05-07T19:47:11.7348900Z 2025-05-07T19:47:18.4950217Z cuda-nvvp-11.8.87 | 114.4 MB | ########## | 100%  2025-05-07T19:47:18.4950628Z 2025-05-07T19:47:18.4950635Z 2025-05-07T19:47:18.4950969Z 2025-05-07T19:47:21.4334448Z libcusparse-dev-11.7 | 359.7 MB | ########## | 100%  2025-05-07T19:47:21.4334818Z 2025-05-07T19:47:21.4334826Z 2025-05-07T19:47:21.4334831Z 2025-05-07T19:47:21.4334837Z 2025-05-07T19:47:21.4334865Z 2025-05-07T19:47:21.4334871Z 2025-05-07T19:47:21.4334876Z 2025-05-07T19:47:21.4334881Z 2025-05-07T19:47:21.4334889Z 2025-05-07T19:47:21.4334894Z 2025-05-07T19:47:21.4334899Z 2025-05-07T19:47:21.4334903Z 2025-05-07T19:47:33.1918528Z libcusolver-dev-11.4 | 66.3 MB | ########## | 100%  2025-05-07T19:47:33.1918948Z 2025-05-07T19:47:33.1918953Z 2025-05-07T19:47:33.1918958Z 2025-05-07T19:47:33.1918961Z 2025-05-07T19:47:33.1918965Z 2025-05-07T19:47:33.1918968Z 2025-05-07T19:47:33.1918972Z 2025-05-07T19:47:33.1918978Z 2025-05-07T19:47:35.1779885Z libcufft-10.9.0.58 | 142.8 MB | ########## | 100%  2025-05-07T19:47:35.1780273Z 2025-05-07T19:47:35.1780304Z 2025-05-07T19:47:35.1780348Z 2025-05-07T19:47:35.1780369Z 2025-05-07T19:47:35.1780373Z 2025-05-07T19:47:35.1780376Z 2025-05-07T19:47:35.1780380Z 2025-05-07T19:47:35.1780383Z 2025-05-07T19:47:35.1780387Z 2025-05-07T19:47:35.1780390Z 2025-05-07T19:47:35.1780394Z 2025-05-07T19:47:37.7520360Z libcusolver-11.4.1.4 | 96.5 MB | ########## | 100%  2025-05-07T19:47:37.7520740Z 2025-05-07T19:47:37.7520746Z 2025-05-07T19:47:37.7520751Z 2025-05-07T19:47:37.7520756Z 2025-05-07T19:47:37.7520787Z 2025-05-07T19:47:37.7520791Z 2025-05-07T19:47:37.7520796Z 2025-05-07T19:47:37.7520801Z 2025-05-07T19:47:37.7520807Z 2025-05-07T19:47:37.7520812Z 2025-05-07T19:47:37.7520817Z 2025-05-07T19:47:37.7520821Z 2025-05-07T19:47:37.7520824Z 2025-05-07T19:47:39.7650375Z libcurand-dev-10.3.0 | 53.7 MB | ########## | 100%  2025-05-07T19:47:39.7650784Z 2025-05-07T19:47:39.7650789Z 2025-05-07T19:47:39.7650794Z 2025-05-07T19:47:39.7650798Z 2025-05-07T19:47:39.7650843Z 2025-05-07T19:47:39.7650864Z 2025-05-07T19:47:39.7650867Z 2025-05-07T19:47:39.7650871Z 2025-05-07T19:47:39.7650874Z 2025-05-07T19:47:39.7650877Z 2025-05-07T19:47:39.7650881Z 2025-05-07T19:47:39.7650884Z 2025-05-07T19:47:39.7650918Z 2025-05-07T19:47:39.7650921Z 2025-05-07T19:47:41.0403026Z libcurand-10.3.0.86 | 53.2 MB | ########## | 100%  2025-05-07T19:47:41.0403418Z 2025-05-07T19:47:41.0403424Z 2025-05-07T19:47:41.0403429Z 2025-05-07T19:47:41.0403435Z 2025-05-07T19:47:41.0403440Z 2025-05-07T19:47:41.0403471Z 2025-05-07T19:47:41.0403476Z 2025-05-07T19:47:41.0403481Z 2025-05-07T19:47:41.0403486Z 2025-05-07T19:47:41.0403491Z 2025-05-07T19:47:41.0403494Z 2025-05-07T19:47:41.0403497Z 2025-05-07T19:47:41.0403502Z 2025-05-07T19:47:41.0403506Z 2025-05-07T19:47:41.0403509Z 2025-05-07T19:47:41.0403513Z 2025-05-07T19:47:42.7267157Z cuda-nvdisasm-11.8.8 | 48.7 MB | ########## | 100%  2025-05-07T19:47:42.7267650Z 2025-05-07T19:47:42.7267677Z 2025-05-07T19:47:42.7267681Z 2025-05-07T19:47:42.7267684Z 2025-05-07T19:47:42.7267688Z 2025-05-07T19:47:42.7267691Z 2025-05-07T19:47:42.7267695Z 2025-05-07T19:47:42.7267698Z 2025-05-07T19:47:42.7267702Z 2025-05-07T19:47:42.7267705Z 2025-05-07T19:47:42.7267708Z 2025-05-07T19:47:42.7267712Z 2025-05-07T19:47:42.7267715Z 2025-05-07T19:47:42.7267718Z 2025-05-07T19:47:42.7267722Z 2025-05-07T19:47:42.7267725Z 2025-05-07T19:47:42.7267729Z 2025-05-07T19:47:45.0695563Z cuda-cupti-11.8.87 | 25.3 MB | ########## | 100%  2025-05-07T19:47:45.0695995Z 2025-05-07T19:47:45.0696001Z 2025-05-07T19:47:45.0696006Z 2025-05-07T19:47:45.0696014Z 2025-05-07T19:47:45.0696019Z 2025-05-07T19:47:45.0696024Z 2025-05-07T19:47:45.0696028Z 2025-05-07T19:47:45.0696032Z 2025-05-07T19:47:45.0696038Z 2025-05-07T19:47:45.0696042Z 2025-05-07T19:47:45.0696046Z 2025-05-07T19:47:45.0696050Z 2025-05-07T19:47:45.0696055Z 2025-05-07T19:47:45.0696305Z 2025-05-07T19:47:45.0696437Z 2025-05-07T19:47:45.0696440Z 2025-05-07T19:47:45.0696444Z 2025-05-07T19:47:45.0696447Z 2025-05-07T19:47:46.7859166Z cuda-nvrtc-11.8.89 | 19.1 MB | ########## | 100%  2025-05-07T19:47:46.7859577Z 2025-05-07T19:47:46.7859582Z 2025-05-07T19:47:46.7859587Z 2025-05-07T19:47:46.7859592Z 2025-05-07T19:47:46.7859599Z 2025-05-07T19:47:46.7859602Z 2025-05-07T19:47:46.7859606Z 2025-05-07T19:47:46.7859610Z 2025-05-07T19:47:46.7859615Z 2025-05-07T19:47:46.7859642Z 2025-05-07T19:47:46.7859645Z 2025-05-07T19:47:46.7859649Z 2025-05-07T19:47:46.7859652Z 2025-05-07T19:47:46.7859655Z 2025-05-07T19:47:46.7859659Z 2025-05-07T19:47:49.0336252Z cuda-nvcc-11.8.89 | 50.8 MB | ########## | 100%  2025-05-07T19:47:49.0336684Z 2025-05-07T19:47:49.0336690Z 2025-05-07T19:47:49.0336697Z 2025-05-07T19:47:49.0336702Z 2025-05-07T19:47:49.0336709Z 2025-05-07T19:47:49.0336753Z 2025-05-07T19:47:49.0336783Z 2025-05-07T19:47:49.0336786Z 2025-05-07T19:47:49.0336789Z 2025-05-07T19:47:49.0336793Z 2025-05-07T19:47:49.0336796Z 2025-05-07T19:47:49.0336800Z 2025-05-07T19:47:49.0336803Z 2025-05-07T19:47:49.0336807Z 2025-05-07T19:47:49.0336810Z 2025-05-07T19:47:49.0336814Z 2025-05-07T19:47:49.0336817Z 2025-05-07T19:47:49.0336820Z 2025-05-07T19:47:49.0336823Z 2025-05-07T19:48:00.4711599Z ... (more hidden) ... 2025-05-07T19:48:01.9710931Z nsight-compute-2022. | 610.0 MB | ########## | 100% 2025-05-07T19:48:01.9711402Z 2025-05-07T19:48:01.9711537Z 2025-05-07T19:48:01.9711544Z 2025-05-07T19:48:01.9711576Z 2025-05-07T19:48:01.9711585Z 2025-05-07T19:48:01.9711620Z 2025-05-07T19:48:01.9711648Z 2025-05-07T19:48:01.9717091Z libnpp-dev-11.8.0.86 | 144.5 MB | ########## | 100%  2025-05-07T19:48:01.9717443Z 2025-05-07T19:48:01.9717930Z 2025-05-07T19:48:01.9718080Z 2025-05-07T19:48:01.9718179Z 2025-05-07T19:48:01.9718210Z 2025-05-07T19:48:01.9718260Z 2025-05-07T19:48:01.9718268Z 2025-05-07T19:48:01.9718340Z 2025-05-07T19:48:01.9718385Z 2025-05-07T19:48:01.9718390Z 2025-05-07T19:48:01.9718396Z 2025-05-07T19:48:01.9718403Z 2025-05-07T19:48:01.9718529Z 2025-05-07T19:48:01.9718538Z 2025-05-07T19:48:01.9718543Z 2025-05-07T19:48:01.9718609Z 2025-05-07T19:48:01.9718614Z 2025-05-07T19:48:01.9718619Z 2025-05-07T19:48:01.9718625Z 2025-05-07T19:48:01.9719012Z 2025-05-07T19:48:01.9719554Z  2025-05-07T19:48:01.9719958Z 2025-05-07T19:48:01.9720190Z 2025-05-07T19:48:01.9720473Z  2025-05-07T19:48:01.9720737Z 2025-05-07T19:48:01.9720742Z 2025-05-07T19:48:01.9720996Z  2025-05-07T19:48:01.9721229Z 2025-05-07T19:48:01.9721234Z 2025-05-07T19:48:01.9721274Z 2025-05-07T19:48:01.9721504Z  2025-05-07T19:48:01.9721739Z 2025-05-07T19:48:01.9721743Z 2025-05-07T19:48:01.9721746Z 2025-05-07T19:48:01.9721750Z 2025-05-07T19:48:01.9721938Z  2025-05-07T19:48:01.9722206Z 2025-05-07T19:48:01.9722209Z 2025-05-07T19:48:01.9722213Z 2025-05-07T19:48:01.9722216Z 2025-05-07T19:48:01.9722220Z 2025-05-07T19:48:01.9722410Z  2025-05-07T19:48:01.9722663Z 2025-05-07T19:48:01.9722667Z 2025-05-07T19:48:01.9722670Z 2025-05-07T19:48:01.9722701Z 2025-05-07T19:48:01.9722705Z 2025-05-07T19:48:01.9722708Z 2025-05-07T19:48:01.9722905Z  2025-05-07T19:48:01.9723145Z 2025-05-07T19:48:01.9723148Z 2025-05-07T19:48:01.9723152Z 2025-05-07T19:48:01.9723155Z 2025-05-07T19:48:01.9723159Z 2025-05-07T19:48:01.9723453Z 2025-05-07T19:48:01.9723672Z 2025-05-07T19:48:01.9723939Z  2025-05-07T19:48:01.9724185Z 2025-05-07T19:48:01.9724188Z 2025-05-07T19:48:01.9724192Z 2025-05-07T19:48:01.9724195Z 2025-05-07T19:48:01.9724199Z 2025-05-07T19:48:01.9724202Z 2025-05-07T19:48:01.9724206Z 2025-05-07T19:48:01.9724210Z 2025-05-07T19:48:01.9724449Z  2025-05-07T19:48:01.9724699Z 2025-05-07T19:48:01.9724703Z 2025-05-07T19:48:01.9724707Z 2025-05-07T19:48:01.9724710Z 2025-05-07T19:48:01.9724713Z 2025-05-07T19:48:01.9724717Z 2025-05-07T19:48:01.9724720Z 2025-05-07T19:48:01.9724724Z 2025-05-07T19:48:01.9724727Z 2025-05-07T19:48:01.9724964Z  2025-05-07T19:48:01.9725214Z 2025-05-07T19:48:01.9725217Z 2025-05-07T19:48:01.9725221Z 2025-05-07T19:48:01.9725224Z 2025-05-07T19:48:01.9725227Z 2025-05-07T19:48:01.9725238Z 2025-05-07T19:48:01.9725247Z 2025-05-07T19:48:01.9725250Z 2025-05-07T19:48:01.9725254Z 2025-05-07T19:48:01.9725258Z 2025-05-07T19:48:01.9725493Z  2025-05-07T19:48:01.9725744Z 2025-05-07T19:48:01.9725748Z 2025-05-07T19:48:01.9725752Z 2025-05-07T19:48:01.9725755Z 2025-05-07T19:48:01.9725758Z 2025-05-07T19:48:01.9725762Z 2025-05-07T19:48:01.9725765Z 2025-05-07T19:48:01.9725769Z 2025-05-07T19:48:01.9725772Z 2025-05-07T19:48:01.9725776Z 2025-05-07T19:48:01.9725779Z 2025-05-07T19:48:01.9726029Z  2025-05-07T19:48:01.9726292Z 2025-05-07T19:48:01.9726296Z 2025-05-07T19:48:01.9726299Z 2025-05-07T19:48:01.9726303Z 2025-05-07T19:48:01.9726306Z 2025-05-07T19:48:01.9726310Z 2025-05-07T19:48:01.9726313Z 2025-05-07T19:48:01.9726317Z 2025-05-07T19:48:01.9726320Z 2025-05-07T19:48:01.9726323Z 2025-05-07T19:48:01.9726330Z 2025-05-07T19:48:01.9726339Z 2025-05-07T19:48:01.9726583Z  2025-05-07T19:48:01.9726840Z 2025-05-07T19:48:01.9726844Z 2025-05-07T19:48:01.9726847Z 2025-05-07T19:48:01.9726851Z 2025-05-07T19:48:01.9726854Z 2025-05-07T19:48:01.9726857Z 2025-05-07T19:48:01.9726861Z 2025-05-07T19:48:01.9726864Z 2025-05-07T19:48:01.9726868Z 2025-05-07T19:48:01.9726906Z 2025-05-07T19:48:01.9726910Z 2025-05-07T19:48:01.9726914Z 2025-05-07T19:48:01.9726917Z 2025-05-07T19:48:01.9727135Z  2025-05-07T19:48:01.9727393Z 2025-05-07T19:48:01.9727397Z 2025-05-07T19:48:01.9727400Z 2025-05-07T19:48:01.9727404Z 2025-05-07T19:48:01.9727407Z 2025-05-07T19:48:01.9727411Z 2025-05-07T19:48:01.9727443Z 2025-05-07T19:48:01.9727447Z 2025-05-07T19:48:01.9727450Z 2025-05-07T19:48:01.9727454Z 2025-05-07T19:48:01.9727457Z 2025-05-07T19:48:01.9727468Z 2025-05-07T19:48:01.9727475Z 2025-05-07T19:48:01.9727478Z 2025-05-07T19:48:01.9727704Z  2025-05-07T19:48:01.9727967Z 2025-05-07T19:48:01.9727970Z 2025-05-07T19:48:01.9728000Z 2025-05-07T19:48:01.9728003Z 2025-05-07T19:48:01.9728007Z 2025-05-07T19:48:01.9728010Z 2025-05-07T19:48:01.9728013Z 2025-05-07T19:48:01.9728017Z 2025-05-07T19:48:01.9728020Z 2025-05-07T19:48:01.9728023Z 2025-05-07T19:48:01.9728027Z 2025-05-07T19:48:01.9728030Z 2025-05-07T19:48:01.9728034Z 2025-05-07T19:48:01.9728037Z 2025-05-07T19:48:01.9728040Z 2025-05-07T19:48:01.9728383Z  2025-05-07T19:48:01.9728680Z 2025-05-07T19:48:01.9728683Z 2025-05-07T19:48:01.9728687Z 2025-05-07T19:48:01.9728690Z 2025-05-07T19:48:01.9728694Z 2025-05-07T19:48:01.9728697Z 2025-05-07T19:48:01.9728701Z 2025-05-07T19:48:01.9728705Z 2025-05-07T19:48:01.9728821Z 2025-05-07T19:48:01.9728885Z 2025-05-07T19:48:01.9728888Z 2025-05-07T19:48:01.9728892Z 2025-05-07T19:48:01.9728895Z 2025-05-07T19:48:01.9728899Z 2025-05-07T19:48:01.9728902Z 2025-05-07T19:48:01.9728906Z 2025-05-07T19:48:01.9729175Z  2025-05-07T19:48:01.9729445Z 2025-05-07T19:48:01.9729449Z 2025-05-07T19:48:01.9729452Z 2025-05-07T19:48:01.9729456Z 2025-05-07T19:48:01.9729460Z 2025-05-07T19:48:01.9729463Z 2025-05-07T19:48:01.9729467Z 2025-05-07T19:48:01.9729471Z 2025-05-07T19:48:01.9729474Z 2025-05-07T19:48:01.9729477Z 2025-05-07T19:48:01.9729481Z 2025-05-07T19:48:01.9729484Z 2025-05-07T19:48:01.9729488Z 2025-05-07T19:48:01.9729518Z 2025-05-07T19:48:01.9729521Z 2025-05-07T19:48:01.9729525Z 2025-05-07T19:48:01.9729528Z 2025-05-07T19:48:01.9729776Z  2025-05-07T19:48:01.9730056Z 2025-05-07T19:48:01.9730063Z 2025-05-07T19:48:01.9730067Z 2025-05-07T19:48:01.9730071Z 2025-05-07T19:48:01.9730074Z 2025-05-07T19:48:01.9730108Z 2025-05-07T19:48:01.9730111Z 2025-05-07T19:48:01.9730115Z 2025-05-07T19:48:01.9730118Z 2025-05-07T19:48:01.9730122Z 2025-05-07T19:48:01.9730125Z 2025-05-07T19:48:01.9730129Z 2025-05-07T19:48:01.9730132Z 2025-05-07T19:48:01.9730136Z 2025-05-07T19:48:01.9730139Z 2025-05-07T19:48:01.9730143Z 2025-05-07T19:48:01.9730146Z 2025-05-07T19:48:01.9730150Z 2025-05-07T19:48:01.9730392Z  2025-05-07T19:48:01.9730694Z 2025-05-07T19:48:01.9730698Z 2025-05-07T19:48:01.9730816Z  2025-05-07T19:48:01.9730934Z 2025-05-07T19:48:01.9730938Z 2025-05-07T19:48:01.9731082Z  2025-05-07T19:48:01.9731461Z 2025-05-07T19:48:01.9731465Z 2025-05-07T19:48:01.9731469Z 2025-05-07T19:48:01.9731592Z  2025-05-07T19:48:01.9731751Z 2025-05-07T19:48:01.9731760Z 2025-05-07T19:48:01.9731767Z 2025-05-07T19:48:01.9731770Z 2025-05-07T19:48:01.9731890Z  2025-05-07T19:48:01.9732024Z 2025-05-07T19:48:01.9732028Z 2025-05-07T19:48:01.9732031Z 2025-05-07T19:48:01.9732035Z 2025-05-07T19:48:01.9732039Z 2025-05-07T19:48:01.9732190Z  2025-05-07T19:48:01.9732329Z 2025-05-07T19:48:01.9732333Z 2025-05-07T19:48:01.9732337Z 2025-05-07T19:48:01.9732341Z 2025-05-07T19:48:01.9732344Z 2025-05-07T19:48:01.9732348Z 2025-05-07T19:48:01.9732474Z  2025-05-07T19:48:01.9732658Z 2025-05-07T19:48:01.9732662Z 2025-05-07T19:48:01.9732665Z 2025-05-07T19:48:01.9732669Z 2025-05-07T19:48:01.9732673Z 2025-05-07T19:48:01.9732676Z 2025-05-07T19:48:01.9732680Z 2025-05-07T19:48:01.9732810Z  2025-05-07T19:48:01.9732997Z 2025-05-07T19:48:01.9733001Z 2025-05-07T19:48:01.9733004Z 2025-05-07T19:48:01.9733008Z 2025-05-07T19:48:01.9733012Z 2025-05-07T19:48:01.9733015Z 2025-05-07T19:48:01.9733018Z 2025-05-07T19:48:01.9733027Z 2025-05-07T19:48:01.9733166Z  2025-05-07T19:48:01.9733340Z 2025-05-07T19:48:01.9733371Z 2025-05-07T19:48:01.9733374Z 2025-05-07T19:48:01.9733378Z 2025-05-07T19:48:01.9733381Z 2025-05-07T19:48:01.9733385Z 2025-05-07T19:48:01.9733388Z 2025-05-07T19:48:01.9733392Z 2025-05-07T19:48:01.9733395Z 2025-05-07T19:48:01.9733525Z  2025-05-07T19:48:01.9733697Z 2025-05-07T19:48:01.9733701Z 2025-05-07T19:48:01.9733705Z 2025-05-07T19:48:01.9733732Z 2025-05-07T19:48:01.9733736Z 2025-05-07T19:48:01.9733739Z 2025-05-07T19:48:01.9733743Z 2025-05-07T19:48:01.9733746Z 2025-05-07T19:48:01.9733749Z 2025-05-07T19:48:01.9733753Z 2025-05-07T19:48:01.9733900Z  2025-05-07T19:48:01.9734085Z 2025-05-07T19:48:01.9734088Z 2025-05-07T19:48:01.9734092Z 2025-05-07T19:48:01.9734096Z 2025-05-07T19:48:01.9734129Z 2025-05-07T19:48:01.9734132Z 2025-05-07T19:48:01.9734135Z 2025-05-07T19:48:01.9734139Z 2025-05-07T19:48:01.9734142Z 2025-05-07T19:48:01.9734256Z 2025-05-07T19:48:01.9734353Z 2025-05-07T19:48:01.9734498Z  2025-05-07T19:48:01.9734693Z 2025-05-07T19:48:01.9734697Z 2025-05-07T19:48:01.9734701Z 2025-05-07T19:48:01.9734705Z 2025-05-07T19:48:01.9734735Z 2025-05-07T19:48:01.9734738Z 2025-05-07T19:48:01.9734742Z 2025-05-07T19:48:01.9734745Z 2025-05-07T19:48:01.9734748Z 2025-05-07T19:48:01.9734752Z 2025-05-07T19:48:01.9734755Z 2025-05-07T19:48:01.9734759Z 2025-05-07T19:48:01.9734902Z  2025-05-07T19:48:01.9735108Z 2025-05-07T19:48:01.9735112Z 2025-05-07T19:48:01.9735115Z 2025-05-07T19:48:01.9735143Z 2025-05-07T19:48:01.9735147Z 2025-05-07T19:48:01.9735150Z 2025-05-07T19:48:01.9735153Z 2025-05-07T19:48:01.9735157Z 2025-05-07T19:48:01.9735160Z 2025-05-07T19:48:01.9735164Z 2025-05-07T19:48:01.9735167Z 2025-05-07T19:48:01.9735170Z 2025-05-07T19:48:01.9735174Z 2025-05-07T19:48:01.9735325Z  2025-05-07T19:48:01.9735568Z 2025-05-07T19:48:01.9735575Z 2025-05-07T19:48:01.9735578Z 2025-05-07T19:48:01.9735582Z 2025-05-07T19:48:01.9735585Z 2025-05-07T19:48:01.9735588Z 2025-05-07T19:48:01.9735592Z 2025-05-07T19:48:01.9735596Z 2025-05-07T19:48:01.9735599Z 2025-05-07T19:48:01.9735603Z 2025-05-07T19:48:01.9735607Z 2025-05-07T19:48:01.9735610Z 2025-05-07T19:48:01.9735613Z 2025-05-07T19:48:01.9735617Z 2025-05-07T19:48:01.9735769Z  2025-05-07T19:48:01.9736010Z 2025-05-07T19:48:01.9736014Z 2025-05-07T19:48:01.9736018Z 2025-05-07T19:48:01.9736021Z 2025-05-07T19:48:01.9736025Z 2025-05-07T19:48:01.9736028Z 2025-05-07T19:48:01.9736032Z 2025-05-07T19:48:01.9736035Z 2025-05-07T19:48:01.9736038Z 2025-05-07T19:48:01.9736042Z 2025-05-07T19:48:01.9736045Z 2025-05-07T19:48:01.9736048Z 2025-05-07T19:48:01.9736052Z 2025-05-07T19:48:01.9736055Z 2025-05-07T19:48:01.9736059Z 2025-05-07T19:48:01.9736251Z  2025-05-07T19:48:01.9736471Z 2025-05-07T19:48:01.9736478Z 2025-05-07T19:48:01.9736485Z 2025-05-07T19:48:01.9736489Z 2025-05-07T19:48:01.9736492Z 2025-05-07T19:48:01.9736496Z 2025-05-07T19:48:01.9736499Z 2025-05-07T19:48:01.9736502Z 2025-05-07T19:48:01.9736506Z 2025-05-07T19:48:01.9736509Z 2025-05-07T19:48:01.9736513Z 2025-05-07T19:48:01.9736516Z 2025-05-07T19:48:01.9736520Z 2025-05-07T19:48:01.9736523Z 2025-05-07T19:48:01.9736526Z 2025-05-07T19:48:01.9736530Z 2025-05-07T19:48:01.9736723Z  2025-05-07T19:48:01.9736947Z 2025-05-07T19:48:01.9736951Z 2025-05-07T19:48:01.9736954Z 2025-05-07T19:48:01.9736958Z 2025-05-07T19:48:01.9736961Z 2025-05-07T19:48:01.9736965Z 2025-05-07T19:48:01.9736968Z 2025-05-07T19:48:01.9736972Z 2025-05-07T19:48:01.9736975Z 2025-05-07T19:48:01.9736978Z 2025-05-07T19:48:01.9736982Z 2025-05-07T19:48:01.9737010Z 2025-05-07T19:48:01.9737013Z 2025-05-07T19:48:01.9737017Z 2025-05-07T19:48:01.9737020Z 2025-05-07T19:48:01.9737024Z 2025-05-07T19:48:01.9737030Z 2025-05-07T19:48:01.9737204Z  2025-05-07T19:48:01.9737435Z 2025-05-07T19:48:01.9737439Z 2025-05-07T19:48:01.9737442Z 2025-05-07T19:48:01.9737446Z 2025-05-07T19:48:01.9737473Z 2025-05-07T19:48:01.9737477Z 2025-05-07T19:48:01.9737480Z 2025-05-07T19:48:01.9737484Z 2025-05-07T19:48:01.9737487Z 2025-05-07T19:48:01.9737491Z 2025-05-07T19:48:01.9737494Z 2025-05-07T19:48:01.9737498Z 2025-05-07T19:48:01.9737501Z 2025-05-07T19:48:01.9737504Z 2025-05-07T19:48:01.9737508Z 2025-05-07T19:48:01.9737511Z 2025-05-07T19:48:01.9737515Z 2025-05-07T19:48:01.9737518Z 2025-05-07T19:48:01.9737698Z  2025-05-07T19:48:01.9738066Z 2025-05-07T19:48:01.9738069Z 2025-05-07T19:48:01.9738172Z  2025-05-07T19:48:01.9738281Z 2025-05-07T19:48:01.9738284Z 2025-05-07T19:48:01.9738410Z  2025-05-07T19:48:01.9738524Z 2025-05-07T19:48:01.9738528Z 2025-05-07T19:48:01.9738531Z 2025-05-07T19:48:01.9738694Z  2025-05-07T19:48:01.9738894Z 2025-05-07T19:48:01.9738898Z 2025-05-07T19:48:01.9738901Z 2025-05-07T19:48:01.9738904Z 2025-05-07T19:48:01.9739013Z  2025-05-07T19:48:01.9739141Z 2025-05-07T19:48:01.9739144Z 2025-05-07T19:48:01.9739147Z 2025-05-07T19:48:01.9739150Z 2025-05-07T19:48:01.9739153Z 2025-05-07T19:48:01.9739291Z  2025-05-07T19:48:01.9739419Z 2025-05-07T19:48:01.9739423Z 2025-05-07T19:48:01.9739426Z 2025-05-07T19:48:01.9739429Z 2025-05-07T19:48:01.9739432Z 2025-05-07T19:48:01.9739436Z 2025-05-07T19:48:01.9739546Z  2025-05-07T19:48:01.9739707Z 2025-05-07T19:48:01.9739710Z 2025-05-07T19:48:01.9739713Z 2025-05-07T19:48:01.9739716Z 2025-05-07T19:48:01.9739719Z 2025-05-07T19:48:01.9739722Z 2025-05-07T19:48:01.9739726Z 2025-05-07T19:48:01.9739842Z  2025-05-07T19:48:01.9740016Z 2025-05-07T19:48:01.9740019Z 2025-05-07T19:48:01.9740023Z 2025-05-07T19:48:01.9740026Z 2025-05-07T19:48:01.9740029Z 2025-05-07T19:48:01.9740035Z 2025-05-07T19:48:01.9740041Z 2025-05-07T19:48:01.9740044Z 2025-05-07T19:48:01.9740170Z  2025-05-07T19:48:01.9740326Z 2025-05-07T19:48:01.9740358Z 2025-05-07T19:48:01.9740361Z 2025-05-07T19:48:01.9740364Z 2025-05-07T19:48:01.9740367Z 2025-05-07T19:48:01.9740371Z 2025-05-07T19:48:01.9740374Z 2025-05-07T19:48:01.9740377Z 2025-05-07T19:48:01.9740380Z 2025-05-07T19:48:01.9740505Z  2025-05-07T19:48:01.9740668Z 2025-05-07T19:48:01.9740671Z 2025-05-07T19:48:01.9740675Z 2025-05-07T19:48:01.9740707Z 2025-05-07T19:48:01.9740711Z 2025-05-07T19:48:01.9740714Z 2025-05-07T19:48:01.9740717Z 2025-05-07T19:48:01.9740720Z 2025-05-07T19:48:01.9740723Z 2025-05-07T19:48:01.9740726Z 2025-05-07T19:48:01.9740856Z  2025-05-07T19:48:01.9741029Z 2025-05-07T19:48:01.9741033Z 2025-05-07T19:48:01.9741036Z 2025-05-07T19:48:01.9741039Z 2025-05-07T19:48:01.9741072Z 2025-05-07T19:48:01.9741075Z 2025-05-07T19:48:01.9741078Z 2025-05-07T19:48:01.9741088Z 2025-05-07T19:48:01.9741091Z 2025-05-07T19:48:01.9741094Z 2025-05-07T19:48:01.9741097Z 2025-05-07T19:48:01.9741230Z  2025-05-07T19:48:01.9741412Z 2025-05-07T19:48:01.9741415Z 2025-05-07T19:48:01.9741418Z 2025-05-07T19:48:01.9741422Z 2025-05-07T19:48:01.9741452Z 2025-05-07T19:48:01.9741455Z 2025-05-07T19:48:01.9741459Z 2025-05-07T19:48:01.9741462Z 2025-05-07T19:48:01.9741465Z 2025-05-07T19:48:01.9741468Z 2025-05-07T19:48:01.9741471Z 2025-05-07T19:48:01.9741475Z 2025-05-07T19:48:01.9741628Z  done 2025-05-07T19:48:02.0738258Z Preparing transaction: / done 2025-05-07T19:48:02.2747232Z Verifying transaction: \ | done 2025-05-07T19:48:02.4777812Z Executing transaction: - \ done 2025-05-07T19:48:04.4919789Z [INSTALL] Appending libcuda.so path to LD_LIBRARY_PATH ... 2025-05-07T19:48:04.5285079Z [ENV] Appending to LD_LIBRARY_PATH: /github/home/miniconda/envs/build_binary/lib/stubs ... 2025-05-07T19:48:06.3754422Z + conda env config vars set -n build_binary LD_LIBRARY_PATH=/github/home/miniconda/envs/build_binary/lib:/github/home/miniconda/envs/build_binary/lib/stubs 2025-05-07T19:48:06.3755774Z 2025-05-07T19:48:06.7818354Z 2025-05-07T19:48:06.7820788Z [INSTALL] Setting environment variable NVML_LIB_PATH ... 2025-05-07T19:48:06.8160699Z + conda env config vars set -n build_binary NVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:48:06.8161439Z 2025-05-07T19:48:07.2303209Z 2025-05-07T19:48:07.2304114Z [INSTALL] Setting environment variable CUDA_INCLUDE_DIRS ... 2025-05-07T19:48:07.2306890Z + conda env config vars set -n build_binary CUDA_INCLUDE_DIRS="/github/home/miniconda/envs/build_binary/include/:/github/home/miniconda/envs/build_binary/targets/x86_64-linux/include/" 2025-05-07T19:48:07.2307712Z 2025-05-07T19:48:07.6368601Z 2025-05-07T19:48:09.5791156Z [CHECK] cuda_runtime.h found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/include/cuda_runtime.h 2025-05-07T19:48:11.5655279Z [CHECK] libcuda.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:48:13.5041813Z [CHECK] libnvToolsExt.so found in CONDA_PREFIX PATH (symbolic link): /github/home/miniconda/envs/build_binary/lib/libnvToolsExt.so 2025-05-07T19:48:15.4551330Z [CHECK] libnvidia-ml.so found in CONDA_PREFIX PATH (file): /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:48:17.2734649Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:48:17.2735016Z 2025-05-07T19:48:17.3307778Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:48:21.1266655Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:21.1267385Z Target: x86_64-conda-linux-gnu 2025-05-07T19:48:21.1267728Z Thread model: posix 2025-05-07T19:48:21.1268071Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:48:21.1268819Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang.cfg 2025-05-07T19:48:21.1269320Z 2025-05-07T19:48:21.1836496Z [INSTALL] Resetting compiler symlinks to clang ... 2025-05-07T19:48:24.9378705Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:48:24.9379303Z 2025-05-07T19:48:24.9395835Z 2025-05-07T19:48:24.9421857Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:48:24.9423397Z 2025-05-07T19:48:24.9433975Z 2025-05-07T19:48:24.9455891Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:48:24.9456449Z 2025-05-07T19:48:24.9474691Z 2025-05-07T19:48:24.9506777Z + ln -sf /github/home/miniconda/envs/build_binary/bin/clang++ /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:48:24.9507316Z 2025-05-07T19:48:24.9517952Z 2025-05-07T19:48:24.9518698Z + ls -la /github/home/miniconda/envs/build_binary/etc/conda/activate.d 2025-05-07T19:48:24.9519689Z 2025-05-07T19:48:24.9536778Z total 36 2025-05-07T19:48:24.9537499Z drwxr-xr-x. 2 root root 188 May 7 19:45 . 2025-05-07T19:48:24.9538332Z drwxr-xr-x. 5 root root 62 May 7 19:44 .. 2025-05-07T19:48:24.9538777Z -rw-r--r--. 2 root root 3778 Jun 10 2024 activate-binutils_linux-64.sh 2025-05-07T19:48:24.9539291Z -rw-r--r--. 2 root root 11630 Jun 10 2024 activate-gcc_linux-64.sh 2025-05-07T19:48:24.9539761Z -rw-r--r--. 2 root root 5190 Jun 10 2024 activate-gxx_linux-64.sh 2025-05-07T19:48:24.9540219Z -rw-r--r--. 2 root root 136 Mar 27 01:27 libglib_activate.sh 2025-05-07T19:48:24.9540642Z -rw-r--r--. 2 root root 873 Jun 5 2024 libxml2_activate.sh 2025-05-07T19:48:24.9541089Z -rw-r--r--. 2 root root 499 Nov 30 04:26 openjdk_activate.sh 2025-05-07T19:48:24.9541356Z 2025-05-07T19:48:24.9541533Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:48:24.9541803Z 2025-05-07T19:48:26.8756659Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:48:26.8759442Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:48:26.8759953Z 2025-05-07T19:48:26.8760112Z [BUILD] Setting Clang as the NVCC host compiler: 2025-05-07T19:48:28.7476389Z [BUILD] Setting prepend flags for NVCC ... 2025-05-07T19:48:28.7478847Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++" 2025-05-07T19:48:28.7479637Z 2025-05-07T19:48:29.1579446Z 2025-05-07T19:48:29.1579821Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:48:29.1580140Z 2025-05-07T19:48:30.9958485Z -allow-unsupported-compiler -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:48:30.9959136Z 2025-05-07T19:48:31.0710834Z 2025-05-07T19:48:31.0711686Z [INFO] Printing out all preprocessor defines in nvcc ... 2025-05-07T19:48:31.0712456Z + conda run -n build_binary nvcc --compiler-options -dM -E -x cu - < /dev/null 2025-05-07T19:48:31.0712835Z 2025-05-07T19:48:32.9553781Z #define ADJ_ESTERROR 0x0008 2025-05-07T19:48:32.9554775Z #define ADJ_FREQUENCY 0x0002 2025-05-07T19:48:32.9555578Z #define ADJ_MAXERROR 0x0004 2025-05-07T19:48:32.9556381Z #define ADJ_MICRO 0x1000 2025-05-07T19:48:32.9557148Z #define ADJ_NANO 0x2000 2025-05-07T19:48:32.9557857Z #define ADJ_OFFSET 0x0001 2025-05-07T19:48:32.9558679Z #define ADJ_OFFSET_SINGLESHOT 0x8001 2025-05-07T19:48:32.9559509Z #define ADJ_OFFSET_SS_READ 0xa001 2025-05-07T19:48:32.9559836Z #define ADJ_STATUS 0x0010 2025-05-07T19:48:32.9560101Z #define ADJ_TAI 0x0080 2025-05-07T19:48:32.9560387Z #define ADJ_TICK 0x4000 2025-05-07T19:48:32.9560657Z #define ADJ_TIMECONST 0x0020 2025-05-07T19:48:32.9560993Z #define AIO_PRIO_DELTA_MAX 20 2025-05-07T19:48:32.9561335Z #define BC_BASE_MAX _POSIX2_BC_BASE_MAX 2025-05-07T19:48:32.9561723Z #define BC_DIM_MAX _POSIX2_BC_DIM_MAX 2025-05-07T19:48:32.9562077Z #define BC_SCALE_MAX _POSIX2_BC_SCALE_MAX 2025-05-07T19:48:32.9562429Z #define BC_STRING_MAX _POSIX2_BC_STRING_MAX 2025-05-07T19:48:32.9562794Z #define BIG_ENDIAN __BIG_ENDIAN 2025-05-07T19:48:32.9563095Z #define BUFSIZ _IO_BUFSIZ 2025-05-07T19:48:32.9563405Z #define BYTE_ORDER __BYTE_ORDER 2025-05-07T19:48:32.9563693Z #define CHARCLASS_NAME_MAX 2048 2025-05-07T19:48:32.9564008Z #define CHAR_BIT __CHAR_BIT__ 2025-05-07T19:48:32.9564297Z #define CHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:32.9564609Z #define CHAR_MIN SCHAR_MIN 2025-05-07T19:48:32.9564901Z #define CLOCKS_PER_SEC 1000000l 2025-05-07T19:48:32.9565239Z #define CLOCK_BOOTTIME 7 2025-05-07T19:48:32.9565546Z #define CLOCK_BOOTTIME_ALARM 9 2025-05-07T19:48:32.9565837Z #define CLOCK_MONOTONIC 1 2025-05-07T19:48:32.9566136Z #define CLOCK_MONOTONIC_COARSE 6 2025-05-07T19:48:32.9566438Z #define CLOCK_MONOTONIC_RAW 4 2025-05-07T19:48:32.9566778Z #define CLOCK_PROCESS_CPUTIME_ID 2 2025-05-07T19:48:32.9567099Z #define CLOCK_REALTIME 0 2025-05-07T19:48:32.9567414Z #define CLOCK_REALTIME_ALARM 8 2025-05-07T19:48:32.9567711Z #define CLOCK_REALTIME_COARSE 5 2025-05-07T19:48:32.9568038Z #define CLOCK_TAI 11 2025-05-07T19:48:32.9568309Z #define CLOCK_THREAD_CPUTIME_ID 3 2025-05-07T19:48:32.9568654Z #define COLL_WEIGHTS_MAX 255 2025-05-07T19:48:32.9568955Z #define CUDARTAPI 2025-05-07T19:48:32.9569205Z #define CUDARTAPI_CDECL 2025-05-07T19:48:32.9569670Z #define CUDART_CB 2025-05-07T19:48:32.9569925Z #define CUDART_DEVICE __device__ 2025-05-07T19:48:32.9570265Z #define CUDART_VERSION 11080 2025-05-07T19:48:32.9570574Z #define CUDA_DOUBLE_MATH_FUNCTIONS 1 2025-05-07T19:48:32.9570929Z #define CUDA_IPC_HANDLE_SIZE 64 2025-05-07T19:48:32.9571238Z #define CU_UUID_HAS_BEEN_DEFINED 2025-05-07T19:48:32.9571574Z #define DELAYTIMER_MAX 2147483647 2025-05-07T19:48:32.9571876Z #define DOMAIN 1 2025-05-07T19:48:32.9572155Z #define EOF (-1) 2025-05-07T19:48:32.9572454Z #define EXIT_FAILURE 1 2025-05-07T19:48:32.9572726Z #define EXIT_SUCCESS 0 2025-05-07T19:48:32.9573038Z #define EXPR_NEST_MAX _POSIX2_EXPR_NEST_MAX 2025-05-07T19:48:32.9573421Z #define FD_CLR(fd,fdsetp) __FD_CLR (fd, fdsetp) 2025-05-07T19:48:32.9573844Z #define FD_ISSET(fd,fdsetp) __FD_ISSET (fd, fdsetp) 2025-05-07T19:48:32.9574250Z #define FD_SET(fd,fdsetp) __FD_SET (fd, fdsetp) 2025-05-07T19:48:32.9574636Z #define FD_SETSIZE __FD_SETSIZE 2025-05-07T19:48:32.9575069Z #define FD_ZERO(fdsetp) __FD_ZERO (fdsetp) 2025-05-07T19:48:32.9575421Z #define FILENAME_MAX 4096 2025-05-07T19:48:32.9575712Z #define FOPEN_MAX 16 2025-05-07T19:48:32.9579322Z #define FP_ILOGB0 (-2147483647 - 1) 2025-05-07T19:48:32.9579677Z #define FP_ILOGBNAN (-2147483647 - 1) 2025-05-07T19:48:32.9579983Z #define FP_INFINITE 1 2025-05-07T19:48:32.9580272Z #define FP_NAN 0 2025-05-07T19:48:32.9580516Z #define FP_NORMAL 4 2025-05-07T19:48:32.9580968Z #define FP_SUBNORMAL 3 2025-05-07T19:48:32.9581225Z #define FP_ZERO 2 2025-05-07T19:48:32.9583321Z #define HOST_NAME_MAX 64 2025-05-07T19:48:32.9583801Z #define HUGE 3.40282347e+38F 2025-05-07T19:48:32.9584125Z #define HUGE_VAL (__builtin_huge_val()) 2025-05-07T19:48:32.9584499Z #define HUGE_VALF (__builtin_huge_valf()) 2025-05-07T19:48:32.9584851Z #define HUGE_VALL (__builtin_huge_vall()) 2025-05-07T19:48:32.9585220Z #define INFINITY (__builtin_inff()) 2025-05-07T19:48:32.9585535Z #define INT_MAX __INT_MAX__ 2025-05-07T19:48:32.9585868Z #define INT_MIN (-__INT_MAX__ -1) 2025-05-07T19:48:32.9586170Z #define IOV_MAX 1024 2025-05-07T19:48:32.9586470Z #define LINE_MAX _POSIX2_LINE_MAX 2025-05-07T19:48:32.9586791Z #define LITTLE_ENDIAN __LITTLE_ENDIAN 2025-05-07T19:48:32.9587138Z #define LLONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:32.9587475Z #define LLONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:48:32.9587959Z #define LOGIN_NAME_MAX 256 2025-05-07T19:48:32.9588267Z #define LONG_BIT 64 2025-05-07T19:48:32.9588531Z #define LONG_LONG_MAX __LONG_LONG_MAX__ 2025-05-07T19:48:32.9588910Z #define LONG_LONG_MIN (-__LONG_LONG_MAX__-1LL) 2025-05-07T19:48:32.9589257Z #define LONG_MAX __LONG_MAX__ 2025-05-07T19:48:32.9589584Z #define LONG_MIN (-__LONG_MAX__ -1L) 2025-05-07T19:48:32.9589876Z #define L_ctermid 9 2025-05-07T19:48:32.9590138Z #define L_cuserid 9 2025-05-07T19:48:32.9590377Z #define L_tmpnam 20 2025-05-07T19:48:32.9590645Z #define MATH_ERREXCEPT 2 2025-05-07T19:48:32.9590915Z #define MATH_ERRNO 1 2025-05-07T19:48:32.9591188Z #define MAX_CANON 255 2025-05-07T19:48:32.9591475Z #define MAX_INPUT 255 2025-05-07T19:48:32.9591760Z #define MB_CUR_MAX (__ctype_get_mb_cur_max ()) 2025-05-07T19:48:32.9592119Z #define MB_LEN_MAX 16 2025-05-07T19:48:32.9592531Z #define MOD_CLKA ADJ_OFFSET_SINGLESHOT 2025-05-07T19:48:32.9593064Z #define MOD_CLKB ADJ_TICK 2025-05-07T19:48:32.9593355Z #define MOD_ESTERROR ADJ_ESTERROR 2025-05-07T19:48:32.9593692Z #define MOD_FREQUENCY ADJ_FREQUENCY 2025-05-07T19:48:32.9594009Z #define MOD_MAXERROR ADJ_MAXERROR 2025-05-07T19:48:32.9594341Z #define MOD_MICRO ADJ_MICRO 2025-05-07T19:48:32.9594628Z #define MOD_NANO ADJ_NANO 2025-05-07T19:48:32.9594937Z #define MOD_OFFSET ADJ_OFFSET 2025-05-07T19:48:32.9595258Z #define MOD_STATUS ADJ_STATUS 2025-05-07T19:48:32.9595540Z #define MOD_TAI ADJ_TAI 2025-05-07T19:48:32.9595844Z #define MOD_TIMECONST ADJ_TIMECONST 2025-05-07T19:48:32.9596151Z #define MQ_PRIO_MAX 32768 2025-05-07T19:48:32.9596454Z #define M_1_PI 0.31830988618379067154 2025-05-07T19:48:32.9596801Z #define M_1_PIl 0.318309886183790671537767526745028724L 2025-05-07T19:48:32.9597189Z #define M_2_PI 0.63661977236758134308 2025-05-07T19:48:32.9597530Z #define M_2_PIl 0.636619772367581343075535053490057448L 2025-05-07T19:48:32.9597920Z #define M_2_SQRTPI 1.12837916709551257390 2025-05-07T19:48:32.9598303Z #define M_2_SQRTPIl 1.128379167095512573896158903121545172L 2025-05-07T19:48:32.9598715Z #define M_E 2.7182818284590452354 2025-05-07T19:48:32.9599077Z #define M_El 2.718281828459045235360287471352662498L 2025-05-07T19:48:32.9599428Z #define M_LN10 2.30258509299404568402 2025-05-07T19:48:32.9599804Z #define M_LN10l 2.302585092994045684017991454684364208L 2025-05-07T19:48:32.9600163Z #define M_LN2 0.69314718055994530942 2025-05-07T19:48:32.9600525Z #define M_LN2l 0.693147180559945309417232121458176568L 2025-05-07T19:48:32.9600884Z #define M_LOG10E 0.43429448190325182765 2025-05-07T19:48:32.9601267Z #define M_LOG10El 0.434294481903251827651128918916605082L 2025-05-07T19:48:32.9601635Z #define M_LOG2E 1.4426950408889634074 2025-05-07T19:48:32.9602011Z #define M_LOG2El 1.442695040888963407359924681001892137L 2025-05-07T19:48:32.9602396Z #define M_PI 3.14159265358979323846 2025-05-07T19:48:32.9602703Z #define M_PI_2 1.57079632679489661923 2025-05-07T19:48:32.9603072Z #define M_PI_2l 1.570796326794896619231321691639751442L 2025-05-07T19:48:32.9603431Z #define M_PI_4 0.78539816339744830962 2025-05-07T19:48:32.9603803Z #define M_PI_4l 0.785398163397448309615660845819875721L 2025-05-07T19:48:32.9604193Z #define M_PIl 3.141592653589793238462643383279502884L 2025-05-07T19:48:32.9604680Z #define M_SQRT1_2 0.70710678118654752440 2025-05-07T19:48:32.9605246Z #define M_SQRT1_2l 0.707106781186547524400844362104849039L 2025-05-07T19:48:32.9605644Z #define M_SQRT2 1.41421356237309504880 2025-05-07T19:48:32.9606030Z #define M_SQRT2l 1.414213562373095048801688724209698079L 2025-05-07T19:48:32.9606381Z #define NAME_MAX 255 2025-05-07T19:48:32.9606676Z #define NAN (__builtin_nanf ("")) 2025-05-07T19:48:32.9606976Z #define NFDBITS __NFDBITS 2025-05-07T19:48:32.9607274Z #define NGROUPS_MAX 65536 2025-05-07T19:48:32.9607551Z #define NL_ARGMAX _POSIX_ARG_MAX 2025-05-07T19:48:32.9607879Z #define NL_LANGMAX _POSIX2_LINE_MAX 2025-05-07T19:48:32.9608184Z #define NL_MSGMAX INT_MAX 2025-05-07T19:48:32.9608485Z #define NL_NMAX INT_MAX 2025-05-07T19:48:32.9608774Z #define NL_SETMAX INT_MAX 2025-05-07T19:48:32.9609045Z #define NL_TEXTMAX INT_MAX 2025-05-07T19:48:32.9609338Z #define NULL __null 2025-05-07T19:48:32.9609578Z #define NZERO 20 2025-05-07T19:48:32.9609846Z #define OVERFLOW 3 2025-05-07T19:48:32.9610086Z #define PATH_MAX 4096 2025-05-07T19:48:32.9610387Z #define PDP_ENDIAN __PDP_ENDIAN 2025-05-07T19:48:32.9610726Z #define PIPE_BUF 4096 2025-05-07T19:48:32.9610981Z #define PLOSS 6 2025-05-07T19:48:32.9611390Z #define PTHREAD_DESTRUCTOR_ITERATIONS _POSIX_THREAD_DESTRUCTOR_ITERATIONS 2025-05-07T19:48:32.9611844Z #define PTHREAD_KEYS_MAX 1024 2025-05-07T19:48:32.9612164Z #define PTHREAD_STACK_MIN 16384 2025-05-07T19:48:32.9612455Z #define P_tmpdir "/tmp" 2025-05-07T19:48:32.9612742Z #define RAND_MAX 2147483647 2025-05-07T19:48:32.9613021Z #define RE_DUP_MAX (0x7fff) 2025-05-07T19:48:32.9613328Z #define RTSIG_MAX 32 2025-05-07T19:48:32.9613589Z #define SCHAR_MAX __SCHAR_MAX__ 2025-05-07T19:48:32.9613916Z #define SCHAR_MIN (-__SCHAR_MAX__-1) 2025-05-07T19:48:32.9614250Z #define SEEK_CUR 1 2025-05-07T19:48:32.9614496Z #define SEEK_DATA 3 2025-05-07T19:48:32.9614767Z #define SEEK_END 2 2025-05-07T19:48:32.9615012Z #define SEEK_HOLE 4 2025-05-07T19:48:32.9615285Z #define SEEK_SET 0 2025-05-07T19:48:32.9615543Z #define SEM_VALUE_MAX (2147483647) 2025-05-07T19:48:32.9615891Z #define SHRT_MAX __SHRT_MAX__ 2025-05-07T19:48:32.9616194Z #define SHRT_MIN (-__SHRT_MAX__ -1) 2025-05-07T19:48:32.9616532Z #define SING 2 2025-05-07T19:48:32.9616779Z #define SSIZE_MAX LONG_MAX 2025-05-07T19:48:32.9617091Z #define STA_CLK 0x8000 2025-05-07T19:48:32.9617360Z #define STA_CLOCKERR 0x1000 2025-05-07T19:48:32.9617672Z #define STA_DEL 0x0020 2025-05-07T19:48:32.9617968Z #define STA_FLL 0x0008 2025-05-07T19:48:32.9618224Z #define STA_FREQHOLD 0x0080 2025-05-07T19:48:32.9618490Z #define STA_INS 0x0010 2025-05-07T19:48:32.9618725Z #define STA_MODE 0x4000 2025-05-07T19:48:32.9618979Z #define STA_NANO 0x2000 2025-05-07T19:48:32.9619218Z #define STA_PLL 0x0001 2025-05-07T19:48:32.9619480Z #define STA_PPSERROR 0x0800 2025-05-07T19:48:32.9619739Z #define STA_PPSFREQ 0x0002 2025-05-07T19:48:32.9620018Z #define STA_PPSJITTER 0x0200 2025-05-07T19:48:32.9620287Z #define STA_PPSSIGNAL 0x0100 2025-05-07T19:48:32.9620569Z #define STA_PPSTIME 0x0004 2025-05-07T19:48:32.9620848Z #define STA_PPSWANDER 0x0400 2025-05-07T19:48:32.9621421Z #define STA_RONLY (STA_PPSSIGNAL | STA_PPSJITTER | STA_PPSWANDER | STA_PPSERROR | STA_CLOCKERR | STA_NANO | STA_MODE | STA_CLK) 2025-05-07T19:48:32.9622035Z #define STA_UNSYNC 0x0040 2025-05-07T19:48:32.9622289Z #define TIMER_ABSTIME 1 2025-05-07T19:48:32.9622547Z #define TIME_UTC 1 2025-05-07T19:48:32.9622771Z #define TLOSS 5 2025-05-07T19:48:32.9623027Z #define TMP_MAX 238328 2025-05-07T19:48:32.9623285Z #define TTY_NAME_MAX 32 2025-05-07T19:48:32.9623588Z #define UCHAR_MAX (__SCHAR_MAX__*2 +1) 2025-05-07T19:48:32.9623909Z #define UINT_MAX (__INT_MAX__ *2U +1U) 2025-05-07T19:48:32.9624273Z #define ULLONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:48:32.9624690Z #define ULONG_LONG_MAX (__LONG_LONG_MAX__*2ULL+1ULL) 2025-05-07T19:48:32.9625056Z #define ULONG_MAX (__LONG_MAX__ *2UL+1UL) 2025-05-07T19:48:32.9625399Z #define UNDERFLOW 4 2025-05-07T19:48:32.9625659Z #define USHRT_MAX (__SHRT_MAX__ *2 +1) 2025-05-07T19:48:32.9625983Z #define WCONTINUED 8 2025-05-07T19:48:32.9626331Z #define WEXITED 4 2025-05-07T19:48:32.9626781Z #define WEXITSTATUS(status) __WEXITSTATUS (__WAIT_INT (status)) 2025-05-07T19:48:32.9627295Z #define WIFCONTINUED(status) __WIFCONTINUED (__WAIT_INT (status)) 2025-05-07T19:48:32.9627813Z #define WIFEXITED(status) __WIFEXITED (__WAIT_INT (status)) 2025-05-07T19:48:32.9628320Z #define WIFSIGNALED(status) __WIFSIGNALED (__WAIT_INT (status)) 2025-05-07T19:48:32.9628814Z #define WIFSTOPPED(status) __WIFSTOPPED (__WAIT_INT (status)) 2025-05-07T19:48:32.9629233Z #define WNOHANG 1 2025-05-07T19:48:32.9629482Z #define WNOWAIT 0x01000000 2025-05-07T19:48:32.9629778Z #define WORD_BIT 32 2025-05-07T19:48:32.9630026Z #define WSTOPPED 2 2025-05-07T19:48:32.9630359Z #define WSTOPSIG(status) __WSTOPSIG (__WAIT_INT (status)) 2025-05-07T19:48:32.9630801Z #define WTERMSIG(status) __WTERMSIG (__WAIT_INT (status)) 2025-05-07T19:48:32.9631197Z #define WUNTRACED 2 2025-05-07T19:48:32.9631887Z #define XATTR_LIST_MAX 65536 2025-05-07T19:48:32.9632214Z #define XATTR_NAME_MAX 255 2025-05-07T19:48:32.9632605Z #define XATTR_SIZE_MAX 65536 2025-05-07T19:48:32.9632907Z #define X_TLOSS 1.41484755040568800000e+16 2025-05-07T19:48:32.9633253Z #define _ACRTIMP 2025-05-07T19:48:32.9633496Z #define _ALLOCA_H 1 2025-05-07T19:48:32.9633770Z #define _ASSERT_H 1 2025-05-07T19:48:32.9634024Z #define _ATFILE_SOURCE 1 2025-05-07T19:48:32.9634328Z #define _BITS_BYTESWAP_H 1 2025-05-07T19:48:32.9634609Z #define _BITS_POSIX1_LIM_H 1 2025-05-07T19:48:32.9634917Z #define _BITS_POSIX2_LIM_H 1 2025-05-07T19:48:32.9635208Z #define _BITS_PTHREADTYPES_H 1 2025-05-07T19:48:32.9635517Z #define _BITS_TIMEX_H 1 2025-05-07T19:48:32.9635796Z #define _BITS_TIME_H 1 2025-05-07T19:48:32.9636060Z #define _BITS_TYPESIZES_H 1 2025-05-07T19:48:32.9636358Z #define _BITS_TYPES_H 1 2025-05-07T19:48:32.9636619Z #define _BSD_SOURCE 1 2025-05-07T19:48:32.9636897Z #define _CONCEPT_CHECK_H 1 2025-05-07T19:48:32.9637178Z #define _CPP_TYPE_TRAITS_H 1 2025-05-07T19:48:32.9637478Z #define _CRTIMP 2025-05-07T19:48:32.9637718Z #define _ENDIAN_H 1 2025-05-07T19:48:32.9638000Z #define _EXCEPTION_DEFINES_H 1 2025-05-07T19:48:32.9638295Z #define _EXT_NUMERIC_TRAITS 1 2025-05-07T19:48:32.9638614Z #define _EXT_TYPE_TRAITS 1 2025-05-07T19:48:32.9638886Z #define _FEATURES_H 1 2025-05-07T19:48:32.9639173Z #define _FUNCTEXCEPT_H 1 2025-05-07T19:48:32.9639487Z #define _GCC_LIMITS_H_ 2025-05-07T19:48:32.9639798Z #define _GLIBCXX11_DEPRECATED _GLIBCXX_DEPRECATED 2025-05-07T19:48:32.9640330Z #define _GLIBCXX11_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:32.9640812Z #define _GLIBCXX11_USE_C99_COMPLEX 1 2025-05-07T19:48:32.9641152Z #define _GLIBCXX11_USE_C99_MATH 1 2025-05-07T19:48:32.9641465Z #define _GLIBCXX11_USE_C99_STDIO 1 2025-05-07T19:48:32.9641808Z #define _GLIBCXX11_USE_C99_STDLIB 1 2025-05-07T19:48:32.9642122Z #define _GLIBCXX11_USE_C99_WCHAR 1 2025-05-07T19:48:32.9642468Z #define _GLIBCXX14_CONSTEXPR constexpr 2025-05-07T19:48:32.9642831Z #define _GLIBCXX17_CONSTEXPR constexpr 2025-05-07T19:48:32.9643202Z #define _GLIBCXX17_DEPRECATED [[__deprecated__]] 2025-05-07T19:48:32.9643731Z #define _GLIBCXX17_DEPRECATED_SUGGEST(ALT) _GLIBCXX_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:32.9644198Z #define _GLIBCXX17_INLINE inline 2025-05-07T19:48:32.9644534Z #define _GLIBCXX20_CONSTEXPR 2025-05-07T19:48:32.9644849Z #define _GLIBCXX20_DEPRECATED(MSG) 2025-05-07T19:48:32.9645323Z #define _GLIBCXX20_DEPRECATED_SUGGEST(ALT) 2025-05-07T19:48:32.9645652Z #define _GLIBCXX98_USE_C99_COMPLEX 1 2025-05-07T19:48:32.9645975Z #define _GLIBCXX98_USE_C99_MATH 1 2025-05-07T19:48:32.9646265Z #define _GLIBCXX98_USE_C99_STDIO 1 2025-05-07T19:48:32.9646589Z #define _GLIBCXX98_USE_C99_STDLIB 1 2025-05-07T19:48:32.9646913Z #define _GLIBCXX98_USE_C99_WCHAR 1 2025-05-07T19:48:32.9647296Z #define _GLIBCXX_ABI_TAG_CXX11 __attribute ((__abi_tag__ ("cxx11"))) 2025-05-07T19:48:32.9647730Z #define _GLIBCXX_ATOMIC_BUILTINS 1 2025-05-07T19:48:32.9648052Z #define _GLIBCXX_BEGIN_EXTERN_C extern "C" { 2025-05-07T19:48:32.9648611Z #define _GLIBCXX_BEGIN_NAMESPACE_ALGO 2025-05-07T19:48:32.9649020Z #define _GLIBCXX_BEGIN_NAMESPACE_CONTAINER 2025-05-07T19:48:32.9649447Z #define _GLIBCXX_BEGIN_NAMESPACE_CXX11 namespace __cxx11 { 2025-05-07T19:48:32.9649835Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL 2025-05-07T19:48:32.9650307Z #define _GLIBCXX_BEGIN_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_BEGIN_NAMESPACE_CXX11 2025-05-07T19:48:32.9650801Z #define _GLIBCXX_BEGIN_NAMESPACE_VERSION 2025-05-07T19:48:32.9651131Z #define _GLIBCXX_BITS_SPECFUN_H 1 2025-05-07T19:48:32.9651467Z #define _GLIBCXX_BITS_STD_ABS_H 2025-05-07T19:48:32.9651756Z #define _GLIBCXX_CMATH 1 2025-05-07T19:48:32.9652085Z #define _GLIBCXX_CONST __attribute__ ((__const__)) 2025-05-07T19:48:32.9652446Z #define _GLIBCXX_CONSTEXPR constexpr 2025-05-07T19:48:32.9652797Z #define _GLIBCXX_CPU_DEFINES 1 2025-05-07T19:48:32.9653076Z #define _GLIBCXX_CSTDLIB 1 2025-05-07T19:48:32.9653360Z #define _GLIBCXX_CXX_CONFIG_H 1 2025-05-07T19:48:32.9653675Z #define _GLIBCXX_DARWIN_USE_64_BIT_INODE 1 2025-05-07T19:48:32.9654004Z #define _GLIBCXX_DEBUG_ASSERT(_Condition) 2025-05-07T19:48:32.9654355Z #define _GLIBCXX_DEBUG_ASSERTIONS_H 1 2025-05-07T19:48:32.9654663Z #define _GLIBCXX_DEBUG_MACRO_SWITCH_H 1 2025-05-07T19:48:32.9654997Z #define _GLIBCXX_DEBUG_ONLY(_Statement) 2025-05-07T19:48:32.9655330Z #define _GLIBCXX_DEBUG_PEDASSERT(_Condition) 2025-05-07T19:48:32.9655730Z #define _GLIBCXX_DEFAULT_ABI_TAG _GLIBCXX_ABI_TAG_CXX11 2025-05-07T19:48:32.9656148Z #define _GLIBCXX_DEPRECATED __attribute__ ((__deprecated__)) 2025-05-07T19:48:32.9656739Z #define _GLIBCXX_DEPRECATED_SUGGEST(ALT) __attribute__ ((__deprecated__ ("use '" ALT "' instead"))) 2025-05-07T19:48:32.9657289Z #define _GLIBCXX_DOUBLE_IS_IEEE_BINARY64 1 2025-05-07T19:48:32.9657609Z #define _GLIBCXX_END_EXTERN_C } 2025-05-07T19:48:32.9657930Z #define _GLIBCXX_END_NAMESPACE_ALGO 2025-05-07T19:48:32.9658251Z #define _GLIBCXX_END_NAMESPACE_CONTAINER 2025-05-07T19:48:32.9658604Z #define _GLIBCXX_END_NAMESPACE_CXX11 } 2025-05-07T19:48:32.9658928Z #define _GLIBCXX_END_NAMESPACE_LDBL 2025-05-07T19:48:32.9659373Z #define _GLIBCXX_END_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_END_NAMESPACE_CXX11 2025-05-07T19:48:32.9659807Z #define _GLIBCXX_END_NAMESPACE_VERSION 2025-05-07T19:48:32.9660149Z #define _GLIBCXX_EXTERN_TEMPLATE 1 2025-05-07T19:48:32.9660471Z #define _GLIBCXX_FAST_MATH 0 2025-05-07T19:48:32.9660758Z #define _GLIBCXX_FLOAT_IS_IEEE_BINARY32 1 2025-05-07T19:48:32.9661188Z #define _GLIBCXX_FORWARD(_Tp,__val) std::forward<_Tp>(__val) 2025-05-07T19:48:32.9661572Z #define _GLIBCXX_FULLY_DYNAMIC_STRING 0 2025-05-07T19:48:32.9661914Z #define _GLIBCXX_FWDREF(_Tp) _Tp&& 2025-05-07T19:48:32.9662210Z #define _GLIBCXX_HAS_GTHREADS 1 2025-05-07T19:48:32.9663116Z #define _GLIBCXX_HAS_NESTED_TYPE(_NTYPE) template> struct __has_##_NTYPE : false_type { }; template struct __has_##_NTYPE<_Tp, __void_t> : true_type { }; 2025-05-07T19:48:32.9664193Z #define _GLIBCXX_HAVE_ACOSF 1 2025-05-07T19:48:32.9664480Z #define _GLIBCXX_HAVE_ACOSL 1 2025-05-07T19:48:32.9664802Z #define _GLIBCXX_HAVE_ALIGNED_ALLOC 1 2025-05-07T19:48:32.9665112Z #define _GLIBCXX_HAVE_ARPA_INET_H 1 2025-05-07T19:48:32.9665438Z #define _GLIBCXX_HAVE_ASINF 1 2025-05-07T19:48:32.9665710Z #define _GLIBCXX_HAVE_ASINL 1 2025-05-07T19:48:32.9666025Z #define _GLIBCXX_HAVE_AS_SYMVER_DIRECTIVE 1 2025-05-07T19:48:32.9666367Z #define _GLIBCXX_HAVE_ATAN2F 1 2025-05-07T19:48:32.9666640Z #define _GLIBCXX_HAVE_ATAN2L 1 2025-05-07T19:48:32.9666933Z #define _GLIBCXX_HAVE_ATANF 1 2025-05-07T19:48:32.9667203Z #define _GLIBCXX_HAVE_ATANL 1 2025-05-07T19:48:32.9667522Z #define _GLIBCXX_HAVE_ATOMIC_LOCK_POLICY 1 2025-05-07T19:48:32.9667858Z #define _GLIBCXX_HAVE_ATTRIBUTE_VISIBILITY 1 2025-05-07T19:48:32.9668211Z #define _GLIBCXX_HAVE_AT_QUICK_EXIT 1 2025-05-07T19:48:32.9668544Z #define _GLIBCXX_HAVE_BUILTIN_HAS_UNIQ_OBJ_REP 1 2025-05-07T19:48:32.9668925Z #define _GLIBCXX_HAVE_BUILTIN_IS_AGGREGATE 1 2025-05-07T19:48:32.9669409Z #define _GLIBCXX_HAVE_BUILTIN_IS_CONSTANT_EVALUATED 1 2025-05-07T19:48:32.9669834Z #define _GLIBCXX_HAVE_BUILTIN_IS_SAME 1 2025-05-07T19:48:32.9670178Z #define _GLIBCXX_HAVE_BUILTIN_LAUNDER 1 2025-05-07T19:48:32.9670485Z #define _GLIBCXX_HAVE_CEILF 1 2025-05-07T19:48:32.9670790Z #define _GLIBCXX_HAVE_CEILL 1 2025-05-07T19:48:32.9671073Z #define _GLIBCXX_HAVE_COMPLEX_H 1 2025-05-07T19:48:32.9671383Z #define _GLIBCXX_HAVE_COSF 1 2025-05-07T19:48:32.9671650Z #define _GLIBCXX_HAVE_COSHF 1 2025-05-07T19:48:32.9671941Z #define _GLIBCXX_HAVE_COSHL 1 2025-05-07T19:48:32.9672211Z #define _GLIBCXX_HAVE_COSL 1 2025-05-07T19:48:32.9672623Z #define _GLIBCXX_HAVE_DIRENT_H 1 2025-05-07T19:48:32.9673131Z #define _GLIBCXX_HAVE_DLFCN_H 1 2025-05-07T19:48:32.9673488Z #define _GLIBCXX_HAVE_ENDIAN_H 1 2025-05-07T19:48:32.9673852Z #define _GLIBCXX_HAVE_EXCEPTION_PTR_SINCE_GCC46 1 2025-05-07T19:48:32.9674218Z #define _GLIBCXX_HAVE_EXECINFO_H 1 2025-05-07T19:48:32.9674565Z #define _GLIBCXX_HAVE_EXPF 1 2025-05-07T19:48:32.9674853Z #define _GLIBCXX_HAVE_EXPL 1 2025-05-07T19:48:32.9675172Z #define _GLIBCXX_HAVE_FABSF 1 2025-05-07T19:48:32.9675466Z #define _GLIBCXX_HAVE_FABSL 1 2025-05-07T19:48:32.9675788Z #define _GLIBCXX_HAVE_FCNTL_H 1 2025-05-07T19:48:32.9676092Z #define _GLIBCXX_HAVE_FENV_H 1 2025-05-07T19:48:32.9676420Z #define _GLIBCXX_HAVE_FINITE 1 2025-05-07T19:48:32.9676751Z #define _GLIBCXX_HAVE_FINITEF 1 2025-05-07T19:48:32.9677051Z #define _GLIBCXX_HAVE_FINITEL 1 2025-05-07T19:48:32.9677368Z #define _GLIBCXX_HAVE_FLOAT_H 1 2025-05-07T19:48:32.9677661Z #define _GLIBCXX_HAVE_FLOORF 1 2025-05-07T19:48:32.9677985Z #define _GLIBCXX_HAVE_FLOORL 1 2025-05-07T19:48:32.9678271Z #define _GLIBCXX_HAVE_FMODF 1 2025-05-07T19:48:32.9678579Z #define _GLIBCXX_HAVE_FMODL 1 2025-05-07T19:48:32.9678873Z #define _GLIBCXX_HAVE_FREXPF 1 2025-05-07T19:48:32.9679189Z #define _GLIBCXX_HAVE_FREXPL 1 2025-05-07T19:48:32.9679491Z #define _GLIBCXX_HAVE_GETIPINFO 1 2025-05-07T19:48:32.9679812Z #define _GLIBCXX_HAVE_GETS 1 2025-05-07T19:48:32.9680107Z #define _GLIBCXX_HAVE_HYPOT 1 2025-05-07T19:48:32.9680400Z #define _GLIBCXX_HAVE_HYPOTF 1 2025-05-07T19:48:32.9680710Z #define _GLIBCXX_HAVE_HYPOTL 1 2025-05-07T19:48:32.9681012Z #define _GLIBCXX_HAVE_ICONV 1 2025-05-07T19:48:32.9681340Z #define _GLIBCXX_HAVE_INT64_T 1 2025-05-07T19:48:32.9681643Z #define _GLIBCXX_HAVE_INT64_T_LONG 1 2025-05-07T19:48:32.9681999Z #define _GLIBCXX_HAVE_INTTYPES_H 1 2025-05-07T19:48:32.9682311Z #define _GLIBCXX_HAVE_ISINF 1 2025-05-07T19:48:32.9682643Z #define _GLIBCXX_HAVE_ISINFF 1 2025-05-07T19:48:32.9682945Z #define _GLIBCXX_HAVE_ISINFL 1 2025-05-07T19:48:32.9683275Z #define _GLIBCXX_HAVE_ISNAN 1 2025-05-07T19:48:32.9683607Z #define _GLIBCXX_HAVE_ISNANF 1 2025-05-07T19:48:32.9683908Z #define _GLIBCXX_HAVE_ISNANL 1 2025-05-07T19:48:32.9684245Z #define _GLIBCXX_HAVE_ISWBLANK 1 2025-05-07T19:48:32.9684563Z #define _GLIBCXX_HAVE_LC_MESSAGES 1 2025-05-07T19:48:32.9684919Z #define _GLIBCXX_HAVE_LDEXPF 1 2025-05-07T19:48:32.9685340Z #define _GLIBCXX_HAVE_LDEXPL 1 2025-05-07T19:48:32.9685656Z #define _GLIBCXX_HAVE_LIMIT_AS 1 2025-05-07T19:48:32.9685960Z #define _GLIBCXX_HAVE_LIMIT_DATA 1 2025-05-07T19:48:32.9686447Z #define _GLIBCXX_HAVE_LIMIT_FSIZE 1 2025-05-07T19:48:32.9686749Z #define _GLIBCXX_HAVE_LIMIT_RSS 1 2025-05-07T19:48:32.9687074Z #define _GLIBCXX_HAVE_LIMIT_VMEM 0 2025-05-07T19:48:32.9687401Z #define _GLIBCXX_HAVE_LINK 1 2025-05-07T19:48:32.9687679Z #define _GLIBCXX_HAVE_LINUX_FUTEX 1 2025-05-07T19:48:32.9687998Z #define _GLIBCXX_HAVE_LINUX_RANDOM_H 1 2025-05-07T19:48:32.9688304Z #define _GLIBCXX_HAVE_LINUX_TYPES_H 1 2025-05-07T19:48:32.9688631Z #define _GLIBCXX_HAVE_LOCALE_H 1 2025-05-07T19:48:32.9688909Z #define _GLIBCXX_HAVE_LOG10F 1 2025-05-07T19:48:32.9689204Z #define _GLIBCXX_HAVE_LOG10L 1 2025-05-07T19:48:32.9689474Z #define _GLIBCXX_HAVE_LOGF 1 2025-05-07T19:48:32.9689768Z #define _GLIBCXX_HAVE_LOGL 1 2025-05-07T19:48:32.9690044Z #define _GLIBCXX_HAVE_MBSTATE_T 1 2025-05-07T19:48:32.9690349Z #define _GLIBCXX_HAVE_MEMALIGN 1 2025-05-07T19:48:32.9690658Z #define _GLIBCXX_HAVE_MEMORY_H 1 2025-05-07T19:48:32.9691036Z #define _GLIBCXX_HAVE_MODF 1 2025-05-07T19:48:32.9691398Z #define _GLIBCXX_HAVE_MODFF 1 2025-05-07T19:48:32.9691669Z #define _GLIBCXX_HAVE_MODFL 1 2025-05-07T19:48:32.9691965Z #define _GLIBCXX_HAVE_NETDB_H 1 2025-05-07T19:48:32.9692258Z #define _GLIBCXX_HAVE_NETINET_IN_H 1 2025-05-07T19:48:32.9692590Z #define _GLIBCXX_HAVE_NETINET_TCP_H 1 2025-05-07T19:48:32.9692902Z #define _GLIBCXX_HAVE_OBSOLETE_ISINF 1 2025-05-07T19:48:32.9693240Z #define _GLIBCXX_HAVE_OBSOLETE_ISNAN 1 2025-05-07T19:48:32.9693542Z #define _GLIBCXX_HAVE_POLL 1 2025-05-07T19:48:32.9693841Z #define _GLIBCXX_HAVE_POLL_H 1 2025-05-07T19:48:32.9694148Z #define _GLIBCXX_HAVE_POSIX_MEMALIGN 1 2025-05-07T19:48:32.9694461Z #define _GLIBCXX_HAVE_POSIX_SEMAPHORE 1 2025-05-07T19:48:32.9694795Z #define _GLIBCXX_HAVE_POWF 1 2025-05-07T19:48:32.9695067Z #define _GLIBCXX_HAVE_POWL 1 2025-05-07T19:48:32.9695364Z #define _GLIBCXX_HAVE_QUICK_EXIT 1 2025-05-07T19:48:32.9695657Z #define _GLIBCXX_HAVE_READLINK 1 2025-05-07T19:48:32.9695959Z #define _GLIBCXX_HAVE_SETENV 1 2025-05-07T19:48:32.9696237Z #define _GLIBCXX_HAVE_SINCOS 1 2025-05-07T19:48:32.9696540Z #define _GLIBCXX_HAVE_SINCOSF 1 2025-05-07T19:48:32.9696845Z #define _GLIBCXX_HAVE_SINCOSL 1 2025-05-07T19:48:32.9697118Z #define _GLIBCXX_HAVE_SINF 1 2025-05-07T19:48:32.9697412Z #define _GLIBCXX_HAVE_SINHF 1 2025-05-07T19:48:32.9697680Z #define _GLIBCXX_HAVE_SINHL 1 2025-05-07T19:48:32.9697972Z #define _GLIBCXX_HAVE_SINL 1 2025-05-07T19:48:32.9698245Z #define _GLIBCXX_HAVE_SOCKATMARK 1 2025-05-07T19:48:32.9698569Z #define _GLIBCXX_HAVE_SQRTF 1 2025-05-07T19:48:32.9698838Z #define _GLIBCXX_HAVE_SQRTL 1 2025-05-07T19:48:32.9699139Z #define _GLIBCXX_HAVE_STDALIGN_H 1 2025-05-07T19:48:32.9699429Z #define _GLIBCXX_HAVE_STDBOOL_H 1 2025-05-07T19:48:32.9699740Z #define _GLIBCXX_HAVE_STDINT_H 1 2025-05-07T19:48:32.9700048Z #define _GLIBCXX_HAVE_STDLIB_H 1 2025-05-07T19:48:32.9700331Z #define _GLIBCXX_HAVE_STRERROR_L 1 2025-05-07T19:48:32.9700645Z #define _GLIBCXX_HAVE_STRERROR_R 1 2025-05-07T19:48:32.9700936Z #define _GLIBCXX_HAVE_STRINGS_H 1 2025-05-07T19:48:32.9701250Z #define _GLIBCXX_HAVE_STRING_H 1 2025-05-07T19:48:32.9701531Z #define _GLIBCXX_HAVE_STRTOF 1 2025-05-07T19:48:32.9701837Z #define _GLIBCXX_HAVE_STRTOLD 1 2025-05-07T19:48:32.9702134Z #define _GLIBCXX_HAVE_STRUCT_DIRENT_D_TYPE 1 2025-05-07T19:48:32.9702481Z #define _GLIBCXX_HAVE_STRXFRM_L 1 2025-05-07T19:48:32.9702764Z #define _GLIBCXX_HAVE_SYMLINK 1 2025-05-07T19:48:32.9703137Z #define _GLIBCXX_HAVE_SYMVER_SYMBOL_RENAMING_RUNTIME_SUPPORT 1 2025-05-07T19:48:32.9703553Z #define _GLIBCXX_HAVE_SYS_IOCTL_H 1 2025-05-07T19:48:32.9703844Z #define _GLIBCXX_HAVE_SYS_IPC_H 1 2025-05-07T19:48:32.9704159Z #define _GLIBCXX_HAVE_SYS_PARAM_H 1 2025-05-07T19:48:32.9704460Z #define _GLIBCXX_HAVE_SYS_RESOURCE_H 1 2025-05-07T19:48:32.9704783Z #define _GLIBCXX_HAVE_SYS_SEM_H 1 2025-05-07T19:48:32.9705074Z #define _GLIBCXX_HAVE_SYS_SOCKET_H 1 2025-05-07T19:48:32.9705391Z #define _GLIBCXX_HAVE_SYS_STATVFS_H 1 2025-05-07T19:48:32.9705691Z #define _GLIBCXX_HAVE_SYS_STAT_H 1 2025-05-07T19:48:32.9706013Z #define _GLIBCXX_HAVE_SYS_SYSINFO_H 1 2025-05-07T19:48:32.9706321Z #define _GLIBCXX_HAVE_SYS_TIME_H 1 2025-05-07T19:48:32.9706646Z #define _GLIBCXX_HAVE_SYS_TYPES_H 1 2025-05-07T19:48:32.9706976Z #define _GLIBCXX_HAVE_SYS_UIO_H 1 2025-05-07T19:48:32.9707266Z #define _GLIBCXX_HAVE_S_ISREG 1 2025-05-07T19:48:32.9707577Z #define _GLIBCXX_HAVE_TANF 1 2025-05-07T19:48:32.9707859Z #define _GLIBCXX_HAVE_TANHF 1 2025-05-07T19:48:32.9708164Z #define _GLIBCXX_HAVE_TANHL 1 2025-05-07T19:48:32.9708437Z #define _GLIBCXX_HAVE_TANL 1 2025-05-07T19:48:32.9708737Z #define _GLIBCXX_HAVE_TGMATH_H 1 2025-05-07T19:48:32.9708999Z #define _GLIBCXX_HAVE_TLS 1 2025-05-07T19:48:32.9709265Z #define _GLIBCXX_HAVE_TRUNCATE 1 2025-05-07T19:48:32.9709527Z #define _GLIBCXX_HAVE_UNISTD_H 1 2025-05-07T19:48:32.9709812Z #define _GLIBCXX_HAVE_USELOCALE 1 2025-05-07T19:48:32.9710107Z #define _GLIBCXX_HAVE_UTIME_H 1 2025-05-07T19:48:32.9710380Z #define _GLIBCXX_HAVE_VFWSCANF 1 2025-05-07T19:48:32.9710753Z #define _GLIBCXX_HAVE_VSWSCANF 1 2025-05-07T19:48:32.9711085Z #define _GLIBCXX_HAVE_VWSCANF 1 2025-05-07T19:48:32.9711363Z #define _GLIBCXX_HAVE_WCHAR_H 1 2025-05-07T19:48:32.9711630Z #define _GLIBCXX_HAVE_WCSTOF 1 2025-05-07T19:48:32.9711913Z #define _GLIBCXX_HAVE_WCTYPE_H 1 2025-05-07T19:48:32.9712192Z #define _GLIBCXX_HAVE_WRITEV 1 2025-05-07T19:48:32.9712604Z #define _GLIBCXX_HAVE_XLOCALE_H 1 2025-05-07T19:48:32.9713069Z #define _GLIBCXX_HOSTED 1 2025-05-07T19:48:32.9713358Z #define _GLIBCXX_ICONV_CONST 2025-05-07T19:48:32.9713659Z #define _GLIBCXX_INLINE_VERSION 0 2025-05-07T19:48:32.9713957Z #define _GLIBCXX_LT_OBJDIR ".libs/" 2025-05-07T19:48:32.9714518Z #define _GLIBCXX_MAKE_MOVE_IF_NOEXCEPT_ITERATOR(_Iter) std::__make_move_if_noexcept_iterator(_Iter) 2025-05-07T19:48:32.9715203Z #define _GLIBCXX_MAKE_MOVE_ITERATOR(_Iter) std::make_move_iterator(_Iter) 2025-05-07T19:48:32.9715701Z #define _GLIBCXX_MANGLE_SIZE_T m 2025-05-07T19:48:32.9716014Z #define _GLIBCXX_MATH_H 1 2025-05-07T19:48:32.9716367Z #define _GLIBCXX_MOVE(__val) std::move(__val) 2025-05-07T19:48:32.9716798Z #define _GLIBCXX_MOVE3(_Tp,_Up,_Vp) std::move(_Tp, _Up, _Vp) 2025-05-07T19:48:32.9717368Z #define _GLIBCXX_MOVE_BACKWARD3(_Tp,_Up,_Vp) std::move_backward(_Tp, _Up, _Vp) 2025-05-07T19:48:32.9717886Z #define _GLIBCXX_NAMESPACE_CXX11 __cxx11:: 2025-05-07T19:48:32.9718235Z #define _GLIBCXX_NAMESPACE_LDBL 2025-05-07T19:48:32.9718673Z #define _GLIBCXX_NAMESPACE_LDBL_OR_CXX11 _GLIBCXX_NAMESPACE_CXX11 2025-05-07T19:48:32.9719276Z #define _GLIBCXX_NATIVE_THREAD_ID (__gthread_active_p() ? __gthread_self() : (__gthread_t)1) 2025-05-07T19:48:32.9719852Z #define _GLIBCXX_NODISCARD [[__nodiscard__]] 2025-05-07T19:48:32.9720213Z #define _GLIBCXX_NOEXCEPT noexcept 2025-05-07T19:48:32.9720614Z #define _GLIBCXX_NOEXCEPT_IF(...) noexcept(__VA_ARGS__) 2025-05-07T19:48:32.9721004Z #define _GLIBCXX_NOEXCEPT_PARM , bool _NE 2025-05-07T19:48:32.9721349Z #define _GLIBCXX_NOEXCEPT_QUAL noexcept (_NE) 2025-05-07T19:48:32.9721749Z #define _GLIBCXX_NORETURN __attribute__ ((__noreturn__)) 2025-05-07T19:48:32.9722142Z #define _GLIBCXX_NOTHROW _GLIBCXX_USE_NOEXCEPT 2025-05-07T19:48:32.9722599Z #define _GLIBCXX_NO_OBSOLETE_ISINF_ISNAN_DYNAMIC __GLIBC_PREREQ(2,23) 2025-05-07T19:48:32.9723021Z #define _GLIBCXX_NUMERIC_LIMITS 1 2025-05-07T19:48:32.9723329Z #define _GLIBCXX_OS_DEFINES 1 2025-05-07T19:48:32.9745965Z #define _GLIBCXX_PACKAGE_BUGREPORT "" 2025-05-07T19:48:32.9746476Z #define _GLIBCXX_PACKAGE_NAME "package-unused" 2025-05-07T19:48:32.9746920Z #define _GLIBCXX_PACKAGE_STRING "package-unused version-unused" 2025-05-07T19:48:32.9747379Z #define _GLIBCXX_PACKAGE_TARNAME "libstdc++" 2025-05-07T19:48:32.9747721Z #define _GLIBCXX_PACKAGE_URL "" 2025-05-07T19:48:32.9748102Z #define _GLIBCXX_PACKAGE__GLIBCXX_VERSION "version-unused" 2025-05-07T19:48:32.9748510Z #define _GLIBCXX_PREDEFINED_OPS_H 1 2025-05-07T19:48:32.9748829Z #define _GLIBCXX_PSEUDO_VISIBILITY(V) 2025-05-07T19:48:32.9749197Z #define _GLIBCXX_PURE __attribute__ ((__pure__)) 2025-05-07T19:48:32.9749544Z #define _GLIBCXX_RELEASE 11 2025-05-07T19:48:32.9749865Z #define _GLIBCXX_RES_LIMITS 1 2025-05-07T19:48:32.9750157Z #define _GLIBCXX_STDC_HEADERS 1 2025-05-07T19:48:32.9750469Z #define _GLIBCXX_STDIO_EOF -1 2025-05-07T19:48:32.9750751Z #define _GLIBCXX_STDIO_SEEK_CUR 1 2025-05-07T19:48:32.9751079Z #define _GLIBCXX_STDIO_SEEK_END 2 2025-05-07T19:48:32.9751371Z #define _GLIBCXX_STDLIB_H 1 2025-05-07T19:48:32.9751664Z #define _GLIBCXX_STD_A std 2025-05-07T19:48:32.9751955Z #define _GLIBCXX_STD_C std 2025-05-07T19:48:32.9752217Z #define _GLIBCXX_SYMVER 1 2025-05-07T19:48:32.9752641Z #define _GLIBCXX_SYMVER_GNU 1 2025-05-07T19:48:32.9753151Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_AFTER(A) 2025-05-07T19:48:32.9753603Z #define _GLIBCXX_SYNCHRONIZATION_HAPPENS_BEFORE(A) 2025-05-07T19:48:32.9753983Z #define _GLIBCXX_THROW(_EXC) 2025-05-07T19:48:32.9754349Z #define _GLIBCXX_THROW_OR_ABORT(_EXC) (throw (_EXC)) 2025-05-07T19:48:32.9754738Z #define _GLIBCXX_TR1_BESSEL_FUNCTION_TCC 1 2025-05-07T19:48:32.9755318Z #define _GLIBCXX_TR1_BETA_FUNCTION_TCC 1 2025-05-07T19:48:32.9755769Z #define _GLIBCXX_TR1_ELL_INTEGRAL_TCC 1 2025-05-07T19:48:32.9756104Z #define _GLIBCXX_TR1_EXP_INTEGRAL_TCC 1 2025-05-07T19:48:32.9756458Z #define _GLIBCXX_TR1_GAMMA_TCC 1 2025-05-07T19:48:32.9756783Z #define _GLIBCXX_TR1_HYPERGEOMETRIC_TCC 1 2025-05-07T19:48:32.9757166Z #define _GLIBCXX_TR1_LEGENDRE_FUNCTION_TCC 1 2025-05-07T19:48:32.9757540Z #define _GLIBCXX_TR1_MODIFIED_BESSEL_FUNC_TCC 1 2025-05-07T19:48:32.9757927Z #define _GLIBCXX_TR1_POLY_HERMITE_TCC 1 2025-05-07T19:48:32.9758267Z #define _GLIBCXX_TR1_POLY_LAGUERRE_TCC 1 2025-05-07T19:48:32.9758634Z #define _GLIBCXX_TR1_RIEMANN_ZETA_TCC 1 2025-05-07T19:48:32.9758986Z #define _GLIBCXX_TR1_SPECIAL_FUNCTION_UTIL_H 1 2025-05-07T19:48:32.9759362Z #define _GLIBCXX_TXN_SAFE 2025-05-07T19:48:32.9759673Z #define _GLIBCXX_TXN_SAFE_DYN 2025-05-07T19:48:32.9759971Z #define _GLIBCXX_TYPE_TRAITS 1 2025-05-07T19:48:32.9760304Z #define _GLIBCXX_USE_ALLOCATOR_NEW 1 2025-05-07T19:48:32.9760622Z #define _GLIBCXX_USE_C99 1 2025-05-07T19:48:32.9761002Z #define _GLIBCXX_USE_C99_COMPLEX _GLIBCXX11_USE_C99_COMPLEX 2025-05-07T19:48:32.9761414Z #define _GLIBCXX_USE_C99_COMPLEX_TR1 1 2025-05-07T19:48:32.9761772Z #define _GLIBCXX_USE_C99_CTYPE_TR1 1 2025-05-07T19:48:32.9762091Z #define _GLIBCXX_USE_C99_FENV_TR1 1 2025-05-07T19:48:32.9762444Z #define _GLIBCXX_USE_C99_INTTYPES_TR1 1 2025-05-07T19:48:32.9762830Z #define _GLIBCXX_USE_C99_INTTYPES_WCHAR_T_TR1 1 2025-05-07T19:48:32.9763233Z #define _GLIBCXX_USE_C99_MATH _GLIBCXX11_USE_C99_MATH 2025-05-07T19:48:32.9763628Z #define _GLIBCXX_USE_C99_MATH_TR1 1 2025-05-07T19:48:32.9763946Z #define _GLIBCXX_USE_C99_STDINT_TR1 1 2025-05-07T19:48:32.9764337Z #define _GLIBCXX_USE_C99_STDIO _GLIBCXX11_USE_C99_STDIO 2025-05-07T19:48:32.9764773Z #define _GLIBCXX_USE_C99_STDLIB _GLIBCXX11_USE_C99_STDLIB 2025-05-07T19:48:32.9765350Z #define _GLIBCXX_USE_C99_WCHAR _GLIBCXX11_USE_C99_WCHAR 2025-05-07T19:48:32.9765709Z #define _GLIBCXX_USE_CLOCK_MONOTONIC 1 2025-05-07T19:48:32.9766021Z #define _GLIBCXX_USE_CLOCK_REALTIME 1 2025-05-07T19:48:32.9766351Z #define _GLIBCXX_USE_CONSTEXPR constexpr 2025-05-07T19:48:32.9766698Z #define _GLIBCXX_USE_CXX11_ABI 1 2025-05-07T19:48:32.9766996Z #define _GLIBCXX_USE_DECIMAL_FLOAT 1 2025-05-07T19:48:32.9767329Z #define _GLIBCXX_USE_DEPRECATED 1 2025-05-07T19:48:32.9767629Z #define _GLIBCXX_USE_DEV_RANDOM 1 2025-05-07T19:48:32.9767943Z #define _GLIBCXX_USE_DUAL_ABI 1 2025-05-07T19:48:32.9768241Z #define _GLIBCXX_USE_FCHMOD 1 2025-05-07T19:48:32.9768514Z #define _GLIBCXX_USE_FCHMODAT 1 2025-05-07T19:48:32.9768812Z #define _GLIBCXX_USE_FLOAT128 1 2025-05-07T19:48:32.9769097Z #define _GLIBCXX_USE_GETTIMEOFDAY 1 2025-05-07T19:48:32.9769417Z #define _GLIBCXX_USE_GET_NPROCS 1 2025-05-07T19:48:32.9769700Z #define _GLIBCXX_USE_INT128 1 2025-05-07T19:48:32.9769990Z #define _GLIBCXX_USE_LFS 1 2025-05-07T19:48:32.9770258Z #define _GLIBCXX_USE_LONG_LONG 1 2025-05-07T19:48:32.9770566Z #define _GLIBCXX_USE_LSTAT 1 2025-05-07T19:48:32.9770837Z #define _GLIBCXX_USE_NANOSLEEP 1 2025-05-07T19:48:32.9771160Z #define _GLIBCXX_USE_NOEXCEPT noexcept 2025-05-07T19:48:32.9771499Z #define _GLIBCXX_USE_PTHREAD_RWLOCK_T 1 2025-05-07T19:48:32.9771807Z #define _GLIBCXX_USE_RANDOM_TR1 1 2025-05-07T19:48:32.9772122Z #define _GLIBCXX_USE_REALPATH 1 2025-05-07T19:48:32.9772403Z #define _GLIBCXX_USE_SCHED_YIELD 1 2025-05-07T19:48:32.9772731Z #define _GLIBCXX_USE_SC_NPROCESSORS_ONLN 1 2025-05-07T19:48:32.9773044Z #define _GLIBCXX_USE_SENDFILE 1 2025-05-07T19:48:32.9773352Z #define _GLIBCXX_USE_STD_SPEC_FUNCS 1 2025-05-07T19:48:32.9773650Z #define _GLIBCXX_USE_ST_MTIM 1 2025-05-07T19:48:32.9774021Z #define _GLIBCXX_USE_TBB_PAR_BACKEND __has_include() 2025-05-07T19:48:32.9774406Z #define _GLIBCXX_USE_TMPNAM 1 2025-05-07T19:48:32.9774709Z #define _GLIBCXX_USE_UTIME 1 2025-05-07T19:48:32.9775006Z #define _GLIBCXX_USE_UTIMENSAT 1 2025-05-07T19:48:32.9775288Z #define _GLIBCXX_USE_WCHAR_T 1 2025-05-07T19:48:32.9775611Z #define _GLIBCXX_USE_WEAK_REF __GXX_WEAK__ 2025-05-07T19:48:32.9775922Z #define _GLIBCXX_UTILITY 1 2025-05-07T19:48:32.9776294Z #define _GLIBCXX_VERBOSE 1 2025-05-07T19:48:32.9776719Z #define _GLIBCXX_VISIBILITY(V) __attribute__ ((__visibility__ (#V))) 2025-05-07T19:48:32.9777153Z #define _GLIBCXX_WEAK_DEFINITION 2025-05-07T19:48:32.9777441Z #define _GLIBCXX_X86_RDRAND 1 2025-05-07T19:48:32.9777744Z #define _GLIBCXX_X86_RDSEED 1 2025-05-07T19:48:32.9778033Z #define _GNU_SOURCE 1 2025-05-07T19:48:32.9778288Z #define _GTHREAD_USE_MUTEX_TIMEDLOCK 1 2025-05-07T19:48:32.9778603Z #define _G_BUFSIZ 8192 2025-05-07T19:48:32.9778841Z #define _G_HAVE_MMAP 1 2025-05-07T19:48:32.9779103Z #define _G_HAVE_MREMAP 1 2025-05-07T19:48:32.9779408Z #define _G_HAVE_ST_BLKSIZE defined (_STATBUF_ST_BLKSIZE) 2025-05-07T19:48:32.9779788Z #define _G_IO_IO_FILE_VERSION 0x20001 2025-05-07T19:48:32.9780075Z #define _G_config_h 1 2025-05-07T19:48:32.9780343Z #define _G_va_list __gnuc_va_list 2025-05-07T19:48:32.9780622Z #define _INITIALIZER_LIST 2025-05-07T19:48:32.9780894Z #define _IOFBF 0 2025-05-07T19:48:32.9781117Z #define _IOLBF 1 2025-05-07T19:48:32.9781363Z #define _IONBF 2 2025-05-07T19:48:32.9781617Z #define _IOS_APPEND 8 2025-05-07T19:48:32.9781855Z #define _IOS_ATEND 4 2025-05-07T19:48:32.9782125Z #define _IOS_BIN 128 2025-05-07T19:48:32.9782370Z #define _IOS_INPUT 1 2025-05-07T19:48:32.9782643Z #define _IOS_NOCREATE 32 2025-05-07T19:48:32.9782903Z #define _IOS_NOREPLACE 64 2025-05-07T19:48:32.9783181Z #define _IOS_OUTPUT 2 2025-05-07T19:48:32.9783423Z #define _IOS_TRUNC 16 2025-05-07T19:48:32.9783688Z #define _IO_BAD_SEEN 0x4000 2025-05-07T19:48:32.9784006Z #define _IO_BE(expr,res) __builtin_expect ((expr), res) 2025-05-07T19:48:32.9784391Z #define _IO_BOOLALPHA 0200000 2025-05-07T19:48:32.9784693Z #define _IO_BUFSIZ _G_BUFSIZ 2025-05-07T19:48:32.9784972Z #define _IO_CURRENTLY_PUTTING 0x800 2025-05-07T19:48:32.9785279Z #define _IO_DEC 020 2025-05-07T19:48:32.9785522Z #define _IO_DELETE_DONT_CLOSE 0x40 2025-05-07T19:48:32.9785997Z #define _IO_DONT_CLOSE 0100000 2025-05-07T19:48:32.9786275Z #define _IO_EOF_SEEN 0x10 2025-05-07T19:48:32.9786565Z #define _IO_ERR_SEEN 0x20 2025-05-07T19:48:32.9786823Z #define _IO_FIXED 010000 2025-05-07T19:48:32.9787104Z #define _IO_FLAGS2_MMAP 1 2025-05-07T19:48:32.9787369Z #define _IO_FLAGS2_NOTCANCEL 2 2025-05-07T19:48:32.9787671Z #define _IO_FLAGS2_USER_WBUF 8 2025-05-07T19:48:32.9788001Z #define _IO_HAVE_ST_BLKSIZE _G_HAVE_ST_BLKSIZE 2025-05-07T19:48:32.9788310Z #define _IO_HEX 0100 2025-05-07T19:48:32.9788573Z #define _IO_INTERNAL 010 2025-05-07T19:48:32.9788823Z #define _IO_IN_BACKUP 0x100 2025-05-07T19:48:32.9789117Z #define _IO_IS_APPENDING 0x1000 2025-05-07T19:48:32.9789401Z #define _IO_IS_FILEBUF 0x2000 2025-05-07T19:48:32.9789690Z #define _IO_LEFT 02 2025-05-07T19:48:32.9789927Z #define _IO_LINE_BUF 0x200 2025-05-07T19:48:32.9790213Z #define _IO_LINKED 0x80 2025-05-07T19:48:32.9790465Z #define _IO_MAGIC 0xFBAD0000 2025-05-07T19:48:32.9790766Z #define _IO_MAGIC_MASK 0xFFFF0000 2025-05-07T19:48:32.9791047Z #define _IO_NO_READS 4 2025-05-07T19:48:32.9791320Z #define _IO_NO_WRITES 8 2025-05-07T19:48:32.9791590Z #define _IO_OCT 040 2025-05-07T19:48:32.9791966Z #define _IO_PENDING_OUTPUT_COUNT(_fp) ((_fp)->_IO_write_ptr - (_fp)->_IO_write_base) 2025-05-07T19:48:32.9792559Z #define _IO_RIGHT 04 2025-05-07T19:48:32.9792986Z #define _IO_SCIENTIFIC 04000 2025-05-07T19:48:32.9793299Z #define _IO_SHOWBASE 0200 2025-05-07T19:48:32.9793666Z #define _IO_SHOWPOINT 0400 2025-05-07T19:48:32.9793968Z #define _IO_SHOWPOS 02000 2025-05-07T19:48:32.9794233Z #define _IO_SKIPWS 01 2025-05-07T19:48:32.9794515Z #define _IO_STDIO 040000 2025-05-07T19:48:32.9794777Z #define _IO_STDIO_H 2025-05-07T19:48:32.9795059Z #define _IO_TIED_PUT_GET 0x400 2025-05-07T19:48:32.9795366Z #define _IO_UNBUFFERED 2 2025-05-07T19:48:32.9795638Z #define _IO_UNIFIED_JUMPTABLES 1 2025-05-07T19:48:32.9795961Z #define _IO_UNITBUF 020000 2025-05-07T19:48:32.9796242Z #define _IO_UPPERCASE 01000 2025-05-07T19:48:32.9796541Z #define _IO_USER_BUF 1 2025-05-07T19:48:32.9796806Z #define _IO_USER_LOCK 0x8000 2025-05-07T19:48:32.9797238Z #define _IO_cleanup_region_end(_Doit) 2025-05-07T19:48:32.9797648Z #define _IO_cleanup_region_start(_fct,_fp) 2025-05-07T19:48:32.9798118Z #define _IO_feof_unlocked(__fp) (((__fp)->_flags & _IO_EOF_SEEN) != 0) 2025-05-07T19:48:32.9798656Z #define _IO_ferror_unlocked(__fp) (((__fp)->_flags & _IO_ERR_SEEN) != 0) 2025-05-07T19:48:32.9799118Z #define _IO_file_flags _flags 2025-05-07T19:48:32.9799435Z #define _IO_flockfile(_fp) 2025-05-07T19:48:32.9799722Z #define _IO_fpos64_t _G_fpos64_t 2025-05-07T19:48:32.9800044Z #define _IO_fpos_t _G_fpos_t 2025-05-07T19:48:32.9800332Z #define _IO_ftrylockfile(_fp) 2025-05-07T19:48:32.9800650Z #define _IO_funlockfile(_fp) 2025-05-07T19:48:32.9801228Z #define _IO_getc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) ? __uflow (_fp) : *(unsigned char *) (_fp)->_IO_read_ptr++) 2025-05-07T19:48:32.9801862Z #define _IO_iconv_t _G_iconv_t 2025-05-07T19:48:32.9802150Z #define _IO_off64_t __off64_t 2025-05-07T19:48:32.9802453Z #define _IO_off_t __off_t 2025-05-07T19:48:32.9802788Z #define _IO_peekc(_fp) _IO_peekc_unlocked (_fp) 2025-05-07T19:48:32.9803469Z #define _IO_peekc_unlocked(_fp) (_IO_BE ((_fp)->_IO_read_ptr >= (_fp)->_IO_read_end, 0) && __underflow (_fp) == EOF ? EOF : *(unsigned char *) (_fp)->_IO_read_ptr) 2025-05-07T19:48:32.9804142Z #define _IO_pid_t __pid_t 2025-05-07T19:48:32.9804815Z #define _IO_putc_unlocked(_ch,_fp) (_IO_BE ((_fp)->_IO_write_ptr >= (_fp)->_IO_write_end, 0) ? __overflow (_fp, (unsigned char) (_ch)) : (unsigned char) (*(_fp)->_IO_write_ptr++ = (_ch))) 2025-05-07T19:48:32.9805628Z #define _IO_size_t size_t 2025-05-07T19:48:32.9805903Z #define _IO_ssize_t __ssize_t 2025-05-07T19:48:32.9806208Z #define _IO_stderr ((_IO_FILE*)(&_IO_2_1_stderr_)) 2025-05-07T19:48:32.9806588Z #define _IO_stdin ((_IO_FILE*)(&_IO_2_1_stdin_)) 2025-05-07T19:48:32.9806939Z #define _IO_stdout ((_IO_FILE*)(&_IO_2_1_stdout_)) 2025-05-07T19:48:32.9807291Z #define _IO_uid_t __uid_t 2025-05-07T19:48:32.9807555Z #define _IO_va_list __gnuc_va_list 2025-05-07T19:48:32.9807862Z #define _IO_wint_t wint_t 2025-05-07T19:48:32.9808113Z #define _ISOC11_SOURCE 1 2025-05-07T19:48:32.9808382Z #define _ISOC95_SOURCE 1 2025-05-07T19:48:32.9808622Z #define _ISOC99_SOURCE 1 2025-05-07T19:48:32.9808896Z #define _LARGEFILE64_SOURCE 1 2025-05-07T19:48:32.9809190Z #define _LARGEFILE_SOURCE 1 2025-05-07T19:48:32.9809449Z #define _LIBC_LIMITS_H_ 1 2025-05-07T19:48:32.9809721Z #define _LINUX_LIMITS_H 2025-05-07T19:48:32.9809965Z #define _LP64 1 2025-05-07T19:48:32.9810204Z #define _MATH_H 1 2025-05-07T19:48:32.9810431Z #define _MATH_H_MATHDEF 1 2025-05-07T19:48:32.9810696Z #define _MOVE_H 1 2025-05-07T19:48:32.9810917Z #define _Mfloat_ float 2025-05-07T19:48:32.9811184Z #define _Mlong_double_ long double 2025-05-07T19:48:32.9811460Z #define _NEW 2025-05-07T19:48:32.9811711Z #define _OLD_STDIO_MAGIC 0xFABC0000 2025-05-07T19:48:32.9812005Z #define _POSIX2_BC_BASE_MAX 99 2025-05-07T19:48:32.9812304Z #define _POSIX2_BC_DIM_MAX 2048 2025-05-07T19:48:32.9812601Z #define _POSIX2_BC_SCALE_MAX 99 2025-05-07T19:48:32.9812881Z #define _POSIX2_BC_STRING_MAX 1000 2025-05-07T19:48:32.9813209Z #define _POSIX2_CHARCLASS_NAME_MAX 14 2025-05-07T19:48:32.9813511Z #define _POSIX2_COLL_WEIGHTS_MAX 2 2025-05-07T19:48:32.9813825Z #define _POSIX2_EXPR_NEST_MAX 32 2025-05-07T19:48:32.9814105Z #define _POSIX2_LINE_MAX 2048 2025-05-07T19:48:32.9814417Z #define _POSIX2_RE_DUP_MAX 255 2025-05-07T19:48:32.9814702Z #define _POSIX_AIO_LISTIO_MAX 2 2025-05-07T19:48:32.9815006Z #define _POSIX_AIO_MAX 1 2025-05-07T19:48:32.9815262Z #define _POSIX_ARG_MAX 4096 2025-05-07T19:48:32.9815547Z #define _POSIX_CHILD_MAX 25 2025-05-07T19:48:32.9815837Z #define _POSIX_CLOCKRES_MIN 20000000 2025-05-07T19:48:32.9816131Z #define _POSIX_C_SOURCE 200809L 2025-05-07T19:48:32.9816442Z #define _POSIX_DELAYTIMER_MAX 32 2025-05-07T19:48:32.9816741Z #define _POSIX_FD_SETSIZE _POSIX_OPEN_MAX 2025-05-07T19:48:32.9817097Z #define _POSIX_HIWAT _POSIX_PIPE_BUF 2025-05-07T19:48:32.9817403Z #define _POSIX_HOST_NAME_MAX 255 2025-05-07T19:48:32.9817707Z #define _POSIX_LINK_MAX 8 2025-05-07T19:48:32.9818060Z #define _POSIX_LOGIN_NAME_MAX 9 2025-05-07T19:48:32.9818457Z #define _POSIX_MAX_CANON 255 2025-05-07T19:48:32.9818728Z #define _POSIX_MAX_INPUT 255 2025-05-07T19:48:32.9819024Z #define _POSIX_MQ_OPEN_MAX 8 2025-05-07T19:48:32.9819315Z #define _POSIX_MQ_PRIO_MAX 32 2025-05-07T19:48:32.9819586Z #define _POSIX_NAME_MAX 14 2025-05-07T19:48:32.9819872Z #define _POSIX_NGROUPS_MAX 8 2025-05-07T19:48:32.9820135Z #define _POSIX_OPEN_MAX 20 2025-05-07T19:48:32.9820419Z #define _POSIX_PATH_MAX 256 2025-05-07T19:48:32.9820683Z #define _POSIX_PIPE_BUF 512 2025-05-07T19:48:32.9820966Z #define _POSIX_QLIMIT 1 2025-05-07T19:48:32.9821220Z #define _POSIX_RE_DUP_MAX 255 2025-05-07T19:48:32.9821513Z #define _POSIX_RTSIG_MAX 8 2025-05-07T19:48:32.9821779Z #define _POSIX_SEM_NSEMS_MAX 256 2025-05-07T19:48:32.9822082Z #define _POSIX_SEM_VALUE_MAX 32767 2025-05-07T19:48:32.9822371Z #define _POSIX_SIGQUEUE_MAX 32 2025-05-07T19:48:32.9822664Z #define _POSIX_SOURCE 1 2025-05-07T19:48:32.9822940Z #define _POSIX_SSIZE_MAX 32767 2025-05-07T19:48:32.9823216Z #define _POSIX_STREAM_MAX 8 2025-05-07T19:48:32.9823510Z #define _POSIX_SYMLINK_MAX 255 2025-05-07T19:48:32.9823783Z #define _POSIX_SYMLOOP_MAX 8 2025-05-07T19:48:32.9824109Z #define _POSIX_THREAD_DESTRUCTOR_ITERATIONS 4 2025-05-07T19:48:32.9824439Z #define _POSIX_THREAD_KEYS_MAX 128 2025-05-07T19:48:32.9824758Z #define _POSIX_THREAD_THREADS_MAX 64 2025-05-07T19:48:32.9825049Z #define _POSIX_TIMER_MAX 32 2025-05-07T19:48:32.9825336Z #define _POSIX_TTY_NAME_MAX 9 2025-05-07T19:48:32.9825609Z #define _POSIX_TZNAME_MAX 6 2025-05-07T19:48:32.9825899Z #define _POSIX_UIO_MAXIOV 16 2025-05-07T19:48:32.9826262Z #define _PSTL_ASSERT(_Condition) __glibcxx_assert(_Condition) 2025-05-07T19:48:32.9826752Z #define _PSTL_ASSERT_MSG(_Condition,_Message) __glibcxx_assert(_Condition) 2025-05-07T19:48:32.9827377Z #define _PSTL_CLANG_VERSION (__clang_major__ * 10000 + __clang_minor__ * 100 + __clang_patchlevel__) 2025-05-07T19:48:32.9827862Z #define _PSTL_CONFIG_H 2025-05-07T19:48:32.9828346Z #define _PSTL_CPP11_STD_ROTATE_BROKEN ((__GLIBCXX__ && __GLIBCXX__ < 20150716) || (_MSC_VER && _MSC_VER < 1800)) 2025-05-07T19:48:32.9829187Z #define _PSTL_CPP14_2RANGE_MISMATCH_EQUAL_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201300L || __cpp_lib_robust_nonmodifying_seq_ops == 201304) 2025-05-07T19:48:32.9829995Z #define _PSTL_CPP14_INTEGER_SEQUENCE_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L) 2025-05-07T19:48:32.9830789Z #define _PSTL_CPP14_MAKE_REVERSE_ITERATOR_PRESENT (_MSC_VER >= 1900 || __cplusplus >= 201402L || __cpp_lib_make_reverse_iterator == 201402) 2025-05-07T19:48:32.9832156Z #define _PSTL_CPP14_VARIABLE_TEMPLATES_PRESENT (!__INTEL_COMPILER || __INTEL_COMPILER >= 1700) && (_MSC_FULL_VER >= 190023918 || __cplusplus >= 201402L) 2025-05-07T19:48:32.9833208Z #define _PSTL_CPP17_EXECUTION_POLICIES_PRESENT (_MSC_VER >= 1912) 2025-05-07T19:48:32.9833720Z #define _PSTL_EARLYEXIT_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:32.9834260Z #define _PSTL_GCC_VERSION (__GNUC__ * 10000 + __GNUC_MINOR__ * 100 + __GNUC_PATCHLEVEL__) 2025-05-07T19:48:32.9834772Z #define _PSTL_HIDE_FROM_ABI_POP 2025-05-07T19:48:32.9835080Z #define _PSTL_HIDE_FROM_ABI_PUSH 2025-05-07T19:48:32.9835483Z #define _PSTL_ICC_18_OMP_SIMD_BROKEN (__INTEL_COMPILER == 1800) 2025-05-07T19:48:32.9835954Z #define _PSTL_MONOTONIC_PRESENT (__INTEL_COMPILER >= 1800) 2025-05-07T19:48:32.9836369Z #define _PSTL_PAR_BACKEND_SERIAL 2025-05-07T19:48:32.9836680Z #define _PSTL_PRAGMA(x) _Pragma(# x) 2025-05-07T19:48:32.9837397Z #define _PSTL_PRAGMA_DECLARE_REDUCTION(NAME,OP) _PSTL_PRAGMA(omp declare reduction(NAME:OP : omp_out(omp_in)) initializer(omp_priv = omp_orig)) 2025-05-07T19:48:32.9838209Z #define _PSTL_PRAGMA_DECLARE_SIMD _PSTL_PRAGMA(omp declare simd) 2025-05-07T19:48:32.9838627Z #define _PSTL_PRAGMA_FORCEINLINE 2025-05-07T19:48:32.9839025Z #define _PSTL_PRAGMA_LOCATION " [Parallel STL message]: " 2025-05-07T19:48:32.9839405Z #define _PSTL_PRAGMA_MESSAGE(x) 2025-05-07T19:48:32.9840111Z #define _PSTL_PRAGMA_MESSAGE_IMPL(x) _PSTL_PRAGMA(message(_PSTL_STRING_CONCAT(_PSTL_PRAGMA_LOCATION, x))) 2025-05-07T19:48:32.9840781Z #define _PSTL_PRAGMA_MESSAGE_POLICIES(x) 2025-05-07T19:48:32.9841179Z #define _PSTL_PRAGMA_SIMD _PSTL_PRAGMA(omp simd) 2025-05-07T19:48:32.9841580Z #define _PSTL_PRAGMA_SIMD_EARLYEXIT 2025-05-07T19:48:32.9841935Z #define _PSTL_PRAGMA_SIMD_EXCLUSIVE_SCAN(PRM) 2025-05-07T19:48:32.9842343Z #define _PSTL_PRAGMA_SIMD_INCLUSIVE_SCAN(PRM) 2025-05-07T19:48:32.9842743Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC(PRM) 2025-05-07T19:48:32.9843211Z #define _PSTL_PRAGMA_SIMD_ORDERED_MONOTONIC_2ARGS(PRM1,PRM2) 2025-05-07T19:48:32.9843759Z #define _PSTL_PRAGMA_SIMD_REDUCTION(PRM) _PSTL_PRAGMA(omp simd reduction(PRM)) 2025-05-07T19:48:32.9844268Z #define _PSTL_PRAGMA_SIMD_SCAN(PRM) 2025-05-07T19:48:32.9844598Z #define _PSTL_PRAGMA_VECTOR_UNALIGNED 2025-05-07T19:48:32.9845094Z #define _PSTL_STRING(x) _PSTL_STRING_AUX(x) 2025-05-07T19:48:32.9845443Z #define _PSTL_STRING_AUX(x) #x 2025-05-07T19:48:32.9845733Z #define _PSTL_STRING_CONCAT(x,y) x #y 2025-05-07T19:48:32.9846056Z #define _PSTL_UDR_PRESENT 0 2025-05-07T19:48:32.9846497Z #define _PSTL_UDS_PRESENT (__INTEL_COMPILER >= 1900 && __INTEL_COMPILER_BUILD_DATE >= 20180626) 2025-05-07T19:48:32.9847018Z #define _PSTL_USAGE_WARNINGS 0 2025-05-07T19:48:32.9847329Z #define _PSTL_USE_NONTEMPORAL_STORES_IF_ALLOWED 2025-05-07T19:48:32.9847692Z #define _PSTL_VERSION 12000 2025-05-07T19:48:32.9847999Z #define _PSTL_VERSION_MAJOR (_PSTL_VERSION / 1000) 2025-05-07T19:48:32.9848421Z #define _PSTL_VERSION_MINOR ((_PSTL_VERSION % 1000) / 10) 2025-05-07T19:48:32.9848829Z #define _PSTL_VERSION_PATCH (_PSTL_VERSION % 10) 2025-05-07T19:48:32.9849155Z #define _PTRDIFF_T 2025-05-07T19:48:32.9849419Z #define _PTR_TRAITS_H 1 2025-05-07T19:48:32.9849673Z #define _SIGSET_H_types 1 2025-05-07T19:48:32.9850035Z #define _SIGSET_NWORDS (1024 / (8 * sizeof (unsigned long int))) 2025-05-07T19:48:32.9850402Z #define _SIZE_T 2025-05-07T19:48:32.9850656Z #define _STDC_PREDEF_H 1 2025-05-07T19:48:32.9850911Z #define _STDIO_H 1 2025-05-07T19:48:32.9851174Z #define _STDIO_USES_IOSTREAM 2025-05-07T19:48:32.9851434Z #define _STDLIB_H 1 2025-05-07T19:48:32.9851692Z #define _STL_ALGOBASE_H 1 2025-05-07T19:48:32.9851981Z #define _STL_ITERATOR_BASE_FUNCS_H 1 2025-05-07T19:48:32.9852277Z #define _STL_ITERATOR_BASE_TYPES_H 1 2025-05-07T19:48:32.9852582Z #define _STL_ITERATOR_H 1 2025-05-07T19:48:32.9852827Z #define _STL_PAIR_H 1 2025-05-07T19:48:32.9853094Z #define _STL_RELOPS_H 1 2025-05-07T19:48:32.9853336Z #define _STRING_H 1 2025-05-07T19:48:32.9853584Z #define _STRUCT_TIMEVAL 1 2025-05-07T19:48:32.9853833Z #define _SVID_SOURCE 1 2025-05-07T19:48:32.9854099Z #define _SYS_CDEFS_H 1 2025-05-07T19:48:32.9854338Z #define _SYS_SELECT_H 1 2025-05-07T19:48:32.9854605Z #define _SYS_SYSMACROS_H 1 2025-05-07T19:48:32.9854881Z #define _SYS_TYPES_H 1 2025-05-07T19:48:32.9855119Z #define _TIME_H 1 2025-05-07T19:48:32.9855370Z #define _VA_LIST_DEFINED 2025-05-07T19:48:32.9855616Z #define _XLOCALE_H 1 2025-05-07T19:48:32.9855915Z #define _XOPEN_IOV_MAX _POSIX_UIO_MAXIOV 2025-05-07T19:48:32.9856220Z #define _XOPEN_LIM_H 1 2025-05-07T19:48:32.9856490Z #define _XOPEN_SOURCE 700 2025-05-07T19:48:32.9856756Z #define _XOPEN_SOURCE_EXTENDED 1 2025-05-07T19:48:32.9857143Z #define __ASMNAME(cname) __ASMNAME2 (__USER_LABEL_PREFIX__, cname) 2025-05-07T19:48:32.9857587Z #define __ASMNAME2(prefix,cname) __STRING (prefix) cname 2025-05-07T19:48:32.9857999Z #define __ASSERT_FUNCTION __PRETTY_FUNCTION__ 2025-05-07T19:48:32.9858355Z #define __ASSERT_VOID_CAST static_cast 2025-05-07T19:48:32.9858667Z #define __ATOMIC_ACQUIRE 2 2025-05-07T19:48:32.9858949Z #define __ATOMIC_ACQ_REL 4 2025-05-07T19:48:32.9859205Z #define __ATOMIC_CONSUME 1 2025-05-07T19:48:32.9859485Z #define __ATOMIC_RELAXED 0 2025-05-07T19:48:32.9859745Z #define __ATOMIC_RELEASE 3 2025-05-07T19:48:32.9860030Z #define __ATOMIC_SEQ_CST 5 2025-05-07T19:48:32.9860299Z #define __BEGIN_DECLS extern "C" { 2025-05-07T19:48:32.9860614Z #define __BEGIN_NAMESPACE_C99 2025-05-07T19:48:32.9860973Z #define __BEGIN_NAMESPACE_STD 2025-05-07T19:48:32.9861909Z #define __BIGGEST_ALIGNMENT__ 16 2025-05-07T19:48:32.9862211Z #define __BIG_ENDIAN 4321 2025-05-07T19:48:32.9862478Z #define __BITINT_MAXWIDTH__ 8388608 2025-05-07T19:48:32.9862801Z #define __BIT_TYPES_DEFINED__ 1 2025-05-07T19:48:32.9863083Z #define __BLKCNT64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:32.9863428Z #define __BLKCNT_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9863765Z #define __BLKSIZE_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9864102Z #define __BOOL_WIDTH__ 8 2025-05-07T19:48:32.9864364Z #define __BYTE_ORDER __LITTLE_ENDIAN 2025-05-07T19:48:32.9864702Z #define __BYTE_ORDER__ __ORDER_LITTLE_ENDIAN__ 2025-05-07T19:48:32.9865027Z #define __CHANNEL_DESCRIPTOR_H__ 2025-05-07T19:48:32.9865349Z #define __CHAR16_TYPE__ unsigned short 2025-05-07T19:48:32.9865666Z #define __CHAR32_TYPE__ unsigned int 2025-05-07T19:48:32.9865945Z #define __CHAR_BIT__ 8 2025-05-07T19:48:32.9866230Z #define __CLANG_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:32.9866556Z #define __CLANG_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:32.9866914Z #define __CLANG_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:32.9867231Z #define __CLANG_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:32.9867559Z #define __CLANG_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:32.9867861Z #define __CLANG_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:32.9868192Z #define __CLANG_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:32.9868530Z #define __CLANG_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:32.9868848Z #define __CLANG_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:32.9869183Z #define __CLANG_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:32.9869488Z #define __CLANG_LIMITS_H 2025-05-07T19:48:32.9869744Z #define __CLANG_MAX_ALIGN_T_DEFINED 2025-05-07T19:48:32.9870010Z #define __CLOCKID_T_TYPE __S32_TYPE 2025-05-07T19:48:32.9870280Z #define __CLOCK_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9870566Z #define __COMMON_FUNCTIONS_H__ 2025-05-07T19:48:32.9870804Z #define __COMPAR_FN_T 2025-05-07T19:48:32.9871028Z #define __CONCAT(x,y) x ## y 2025-05-07T19:48:32.9871271Z #define __CONSTANT_CFSTRINGS__ 1 2025-05-07T19:48:32.9871535Z #define __CUDACC_VER_BUILD__ 89 2025-05-07T19:48:32.9871770Z #define __CUDACC_VER_MAJOR__ 11 2025-05-07T19:48:32.9872016Z #define __CUDACC_VER_MINOR__ 8 2025-05-07T19:48:32.9872878Z #define __CUDACC_VER__ "__CUDACC_VER__ is no longer supported. Use __CUDACC_VER_MAJOR__, __CUDACC_VER_MINOR__, and __CUDACC_VER_BUILD__ instead." 2025-05-07T19:48:32.9873510Z #define __CUDACC__ 1 2025-05-07T19:48:32.9873755Z #define __CUDART_API_PTDS(api) api 2025-05-07T19:48:32.9874040Z #define __CUDART_API_PTSZ(api) api 2025-05-07T19:48:32.9874499Z #define __CUDART_API_VERSION ((__CUDA_API_VER_MAJOR__ * 1000) + (__CUDA_API_VER_MINOR__ * 10)) 2025-05-07T19:48:32.9874974Z #define __CUDA_API_VER_MAJOR__ 11 2025-05-07T19:48:32.9875255Z #define __CUDA_API_VER_MINOR__ 8 2025-05-07T19:48:32.9875520Z #define __CUDA_ARCH_LIST__ 520 2025-05-07T19:48:32.9875785Z #define __CUDA_ARCH__ 520 2025-05-07T19:48:32.9876050Z #define __CUDA_DEVICE_RUNTIME_API_H__ 2025-05-07T19:48:32.9876341Z #define __CUDA_MATH_CRTIMP 2025-05-07T19:48:32.9876609Z #define __CUDA_RUNTIME_API_H__ 2025-05-07T19:48:32.9876864Z #define __CUDA_RUNTIME_H__ 2025-05-07T19:48:32.9877123Z #define __CUDA_SURFACE_TYPES_H__ 2025-05-07T19:48:32.9877394Z #define __CUDA_TEXTURE_TYPES_H__ 2025-05-07T19:48:32.9877676Z #define __DADDR_T_TYPE __S32_TYPE 2025-05-07T19:48:32.9877946Z #define __DBL_DECIMAL_DIG__ 17 2025-05-07T19:48:32.9878245Z #define __DBL_DENORM_MIN__ 4.9406564584124654e-324 2025-05-07T19:48:32.9878551Z #define __DBL_DIG__ 15 2025-05-07T19:48:32.9878818Z #define __DBL_EPSILON__ 2.2204460492503131e-16 2025-05-07T19:48:32.9879140Z #define __DBL_HAS_DENORM__ 1 2025-05-07T19:48:32.9879391Z #define __DBL_HAS_INFINITY__ 1 2025-05-07T19:48:32.9879661Z #define __DBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:32.9879915Z #define __DBL_MANT_DIG__ 53 2025-05-07T19:48:32.9880171Z #define __DBL_MAX_10_EXP__ 308 2025-05-07T19:48:32.9880517Z #define __DBL_MAX_EXP__ 1024 2025-05-07T19:48:32.9880851Z #define __DBL_MAX__ 1.7976931348623157e+308 2025-05-07T19:48:32.9881142Z #define __DBL_MIN_10_EXP__ (-307) 2025-05-07T19:48:32.9881414Z #define __DBL_MIN_EXP__ (-1021) 2025-05-07T19:48:32.9881683Z #define __DBL_MIN__ 2.2250738585072014e-308 2025-05-07T19:48:32.9882002Z #define __DECIMAL_DIG__ __LDBL_DECIMAL_DIG__ 2025-05-07T19:48:32.9882315Z #define __DELETE_THROW throw() 2025-05-07T19:48:32.9882571Z #define __DEPRECATED 1 2025-05-07T19:48:32.9882825Z #define __DEVICE_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9883131Z #define __DEVICE_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:32.9883434Z #define __DEVICE_DOUBLE_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9883735Z #define __DEVICE_DOUBLE_FUNCTIONS_H__ 2025-05-07T19:48:32.9884036Z #define __DEVICE_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9884307Z #define __DEVICE_FUNCTIONS_H__ 2025-05-07T19:48:32.9884585Z #define __DEVICE_LAUNCH_PARAMETERS_H__ 2025-05-07T19:48:32.9884891Z #define __DEVICE_TYPES_H__ 2025-05-07T19:48:32.9885265Z #define __DEV_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:32.9885525Z #define __DRIVER_FUNCTIONS_H__ 2025-05-07T19:48:32.9885766Z #define __DRIVER_TYPES_H__ 2025-05-07T19:48:32.9885995Z #define __ELF__ 1 2025-05-07T19:48:32.9886185Z #define __END_DECLS } 2025-05-07T19:48:32.9886401Z #define __END_NAMESPACE_C99 2025-05-07T19:48:32.9886635Z #define __END_NAMESPACE_STD 2025-05-07T19:48:32.9886871Z #define __EXCEPTIONS 1 2025-05-07T19:48:32.9887082Z #define __EXCEPTION_H 1 2025-05-07T19:48:32.9887325Z #define __FDS_BITS(set) ((set)->fds_bits) 2025-05-07T19:48:32.9887710Z #define __FD_CLR(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] &= ~__FD_MASK (d))) 2025-05-07T19:48:32.9888108Z #define __FD_ELT(d) ((d) / __NFDBITS) 2025-05-07T19:48:32.9888478Z #define __FD_ISSET(d,set) ((__FDS_BITS (set)[__FD_ELT (d)] & __FD_MASK (d)) != 0) 2025-05-07T19:48:32.9888902Z #define __FD_MASK(d) ((__fd_mask) 1 << ((d) % __NFDBITS)) 2025-05-07T19:48:32.9889322Z #define __FD_SET(d,set) ((void) (__FDS_BITS (set)[__FD_ELT (d)] |= __FD_MASK (d))) 2025-05-07T19:48:32.9889699Z #define __FD_SETSIZE 1024 2025-05-07T19:48:32.9890350Z #define __FD_ZERO(fdsp) do { int __d0, __d1; __asm__ __volatile__ ("cld; rep; " __FD_ZERO_STOS : "=c" (__d0), "=D" (__d1) : "a" (0), "0" (sizeof (fd_set) / sizeof (__fd_mask)), "1" (&__FDS_BITS (fdsp)[0]) : "memory"); } while (0) 2025-05-07T19:48:32.9891046Z #define __FD_ZERO_STOS "stosq" 2025-05-07T19:48:32.9891287Z #define __FILE_defined 1 2025-05-07T19:48:32.9891518Z #define __FINITE_MATH_ONLY__ 0 2025-05-07T19:48:32.9891755Z #define __FLOAT128__ 1 2025-05-07T19:48:32.9891986Z #define __FLOAT_WORD_ORDER __BYTE_ORDER 2025-05-07T19:48:32.9892263Z #define __FLT16_DECIMAL_DIG__ 5 2025-05-07T19:48:32.9892543Z #define __FLT16_DENORM_MIN__ 5.9604644775390625e-8F16 2025-05-07T19:48:32.9892832Z #define __FLT16_DIG__ 3 2025-05-07T19:48:32.9893066Z #define __FLT16_EPSILON__ 9.765625e-4F16 2025-05-07T19:48:32.9893334Z #define __FLT16_HAS_DENORM__ 1 2025-05-07T19:48:32.9893590Z #define __FLT16_HAS_INFINITY__ 1 2025-05-07T19:48:32.9894003Z #define __FLT16_HAS_QUIET_NAN__ 1 2025-05-07T19:48:32.9894256Z #define __FLT16_MANT_DIG__ 11 2025-05-07T19:48:32.9894502Z #define __FLT16_MAX_10_EXP__ 4 2025-05-07T19:48:32.9894745Z #define __FLT16_MAX_EXP__ 16 2025-05-07T19:48:32.9894985Z #define __FLT16_MAX__ 6.5504e+4F16 2025-05-07T19:48:32.9895235Z #define __FLT16_MIN_10_EXP__ (-4) 2025-05-07T19:48:32.9895490Z #define __FLT16_MIN_EXP__ (-13) 2025-05-07T19:48:32.9895730Z #define __FLT16_MIN__ 6.103515625e-5F16 2025-05-07T19:48:32.9896011Z #define __FLT_DECIMAL_DIG__ 9 2025-05-07T19:48:32.9896257Z #define __FLT_DENORM_MIN__ 1.40129846e-45F 2025-05-07T19:48:32.9896536Z #define __FLT_DIG__ 6 2025-05-07T19:48:32.9896757Z #define __FLT_EPSILON__ 1.19209290e-7F 2025-05-07T19:48:32.9897019Z #define __FLT_HAS_DENORM__ 1 2025-05-07T19:48:32.9897254Z #define __FLT_HAS_INFINITY__ 1 2025-05-07T19:48:32.9897489Z #define __FLT_HAS_QUIET_NAN__ 1 2025-05-07T19:48:32.9897731Z #define __FLT_MANT_DIG__ 24 2025-05-07T19:48:32.9897956Z #define __FLT_MAX_10_EXP__ 38 2025-05-07T19:48:32.9898282Z #define __FLT_MAX_EXP__ 128 2025-05-07T19:48:32.9898567Z #define __FLT_MAX__ 3.40282347e+38F 2025-05-07T19:48:32.9898822Z #define __FLT_MIN_10_EXP__ (-37) 2025-05-07T19:48:32.9899066Z #define __FLT_MIN_EXP__ (-125) 2025-05-07T19:48:32.9899312Z #define __FLT_MIN__ 1.17549435e-38F 2025-05-07T19:48:32.9899561Z #define __FLT_RADIX__ 2 2025-05-07T19:48:32.9899790Z #define __FSBLKCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:32.9900100Z #define __FSBLKCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:32.9900398Z #define __FSFILCNT64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:32.9900699Z #define __FSFILCNT_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:32.9901003Z #define __FSID_T_TYPE struct { int __val[2]; } 2025-05-07T19:48:32.9901313Z #define __FSWORD_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9901582Z #define __FXSR__ 1 2025-05-07T19:48:32.9901792Z #define __GCC_ASM_FLAG_OUTPUTS__ 1 2025-05-07T19:48:32.9902062Z #define __GCC_ATOMIC_BOOL_LOCK_FREE 2 2025-05-07T19:48:32.9902335Z #define __GCC_ATOMIC_CHAR16_T_LOCK_FREE 2 2025-05-07T19:48:32.9902629Z #define __GCC_ATOMIC_CHAR32_T_LOCK_FREE 2 2025-05-07T19:48:32.9902911Z #define __GCC_ATOMIC_CHAR_LOCK_FREE 2 2025-05-07T19:48:32.9903187Z #define __GCC_ATOMIC_INT_LOCK_FREE 2 2025-05-07T19:48:32.9903451Z #define __GCC_ATOMIC_LLONG_LOCK_FREE 2 2025-05-07T19:48:32.9903725Z #define __GCC_ATOMIC_LONG_LOCK_FREE 2 2025-05-07T19:48:32.9903998Z #define __GCC_ATOMIC_POINTER_LOCK_FREE 2 2025-05-07T19:48:32.9904292Z #define __GCC_ATOMIC_SHORT_LOCK_FREE 2 2025-05-07T19:48:32.9904585Z #define __GCC_ATOMIC_TEST_AND_SET_TRUEVAL 1 2025-05-07T19:48:32.9904872Z #define __GCC_ATOMIC_WCHAR_T_LOCK_FREE 2 2025-05-07T19:48:32.9905157Z #define __GCC_HAVE_DWARF2_CFI_ASM 1 2025-05-07T19:48:32.9905432Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1 1 2025-05-07T19:48:32.9905749Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2 1 2025-05-07T19:48:32.9906045Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4 1 2025-05-07T19:48:32.9906347Z #define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8 1 2025-05-07T19:48:32.9906623Z #define __GID_T_TYPE __U32_TYPE 2025-05-07T19:48:32.9906880Z #define __GLIBCXX_BITSIZE_INT_N_0 128 2025-05-07T19:48:32.9907147Z #define __GLIBCXX_TYPE_INT_N_0 __int128 2025-05-07T19:48:32.9907417Z #define __GLIBCXX__ 20230528 2025-05-07T19:48:32.9907658Z #define __GLIBC_HAVE_LONG_LONG 1 2025-05-07T19:48:32.9907897Z #define __GLIBC_MINOR__ 17 2025-05-07T19:48:32.9908284Z #define __GLIBC_PREREQ(maj,min) ((__GLIBC__ << 16) + __GLIBC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:32.9908705Z #define __GLIBC__ 2 2025-05-07T19:48:32.9908934Z #define __GNUC_GNU_INLINE__ 1 2025-05-07T19:48:32.9909181Z #define __GNUC_MINOR__ 2 2025-05-07T19:48:32.9909435Z #define __GNUC_PATCHLEVEL__ 1 2025-05-07T19:48:32.9909817Z #define __GNUC_PREREQ(maj,min) ((__GNUC__ << 16) + __GNUC_MINOR__ >= ((maj) << 16) + (min)) 2025-05-07T19:48:32.9910247Z #define __GNUC_VA_LIST 2025-05-07T19:48:32.9910488Z #define __GNUC__ 4 2025-05-07T19:48:32.9910709Z #define __GNUG__ 4 2025-05-07T19:48:32.9910966Z #define __GNU_LIBRARY__ 6 2025-05-07T19:48:32.9911235Z #define __GXX_ABI_VERSION 1002 2025-05-07T19:48:32.9911539Z #define __GXX_EXPERIMENTAL_CXX0X__ 1 2025-05-07T19:48:32.9911822Z #define __GXX_RTTI 1 2025-05-07T19:48:32.9912074Z #define __GXX_WEAK__ 1 2025-05-07T19:48:32.9912399Z #define __HAVE_COLUMN 2025-05-07T19:48:32.9912679Z #define __HOST_CONFIG_H__ 2025-05-07T19:48:32.9913133Z #define __HOST_DEFINES_H__ 2025-05-07T19:48:32.9913440Z #define __ID_T_TYPE __U32_TYPE 2025-05-07T19:48:32.9913800Z #define __INO64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:32.9914152Z #define __INO_T_MATCHES_INO64_T 1 2025-05-07T19:48:32.9914506Z #define __INO_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:32.9914833Z #define __INT16_C_SUFFIX__ 2025-05-07T19:48:32.9915131Z #define __INT16_FMTd__ "hd" 2025-05-07T19:48:32.9915411Z #define __INT16_FMTi__ "hi" 2025-05-07T19:48:32.9915708Z #define __INT16_MAX__ 32767 2025-05-07T19:48:32.9915986Z #define __INT16_TYPE__ short 2025-05-07T19:48:32.9916289Z #define __INT32_C_SUFFIX__ 2025-05-07T19:48:32.9916660Z #define __INT32_FMTd__ "d" 2025-05-07T19:48:32.9917016Z #define __INT32_FMTi__ "i" 2025-05-07T19:48:32.9917293Z #define __INT32_MAX__ 2147483647 2025-05-07T19:48:32.9917613Z #define __INT32_TYPE__ int 2025-05-07T19:48:32.9917916Z #define __INT64_C_SUFFIX__ L 2025-05-07T19:48:32.9918196Z #define __INT64_FMTd__ "ld" 2025-05-07T19:48:32.9918495Z #define __INT64_FMTi__ "li" 2025-05-07T19:48:32.9918782Z #define __INT64_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9919131Z #define __INT64_TYPE__ long int 2025-05-07T19:48:32.9919415Z #define __INT8_C_SUFFIX__ 2025-05-07T19:48:32.9919713Z #define __INT8_FMTd__ "hhd" 2025-05-07T19:48:32.9919986Z #define __INT8_FMTi__ "hhi" 2025-05-07T19:48:32.9920274Z #define __INT8_MAX__ 127 2025-05-07T19:48:32.9920546Z #define __INT8_TYPE__ signed char 2025-05-07T19:48:32.9920870Z #define __INTMAX_C_SUFFIX__ L 2025-05-07T19:48:32.9921179Z #define __INTMAX_FMTd__ "ld" 2025-05-07T19:48:32.9921457Z #define __INTMAX_FMTi__ "li" 2025-05-07T19:48:32.9921775Z #define __INTMAX_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9922106Z #define __INTMAX_TYPE__ long int 2025-05-07T19:48:32.9922431Z #define __INTMAX_WIDTH__ 64 2025-05-07T19:48:32.9922707Z #define __INTPTR_FMTd__ "ld" 2025-05-07T19:48:32.9923008Z #define __INTPTR_FMTi__ "li" 2025-05-07T19:48:32.9923304Z #define __INTPTR_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9923649Z #define __INTPTR_TYPE__ long int 2025-05-07T19:48:32.9923946Z #define __INTPTR_WIDTH__ 64 2025-05-07T19:48:32.9924252Z #define __INT_FAST16_FMTd__ "hd" 2025-05-07T19:48:32.9924563Z #define __INT_FAST16_FMTi__ "hi" 2025-05-07T19:48:32.9924857Z #define __INT_FAST16_MAX__ 32767 2025-05-07T19:48:32.9925293Z #define __INT_FAST16_TYPE__ short 2025-05-07T19:48:32.9925398Z #define __INT_FAST16_WIDTH__ 16 2025-05-07T19:48:32.9925500Z #define __INT_FAST32_FMTd__ "d" 2025-05-07T19:48:32.9925627Z #define __INT_FAST32_FMTi__ "i" 2025-05-07T19:48:32.9925731Z #define __INT_FAST32_MAX__ 2147483647 2025-05-07T19:48:32.9925832Z #define __INT_FAST32_TYPE__ int 2025-05-07T19:48:32.9925938Z #define __INT_FAST32_WIDTH__ 32 2025-05-07T19:48:32.9926066Z #define __INT_FAST64_FMTd__ "ld" 2025-05-07T19:48:32.9926171Z #define __INT_FAST64_FMTi__ "li" 2025-05-07T19:48:32.9926301Z #define __INT_FAST64_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9926431Z #define __INT_FAST64_TYPE__ long int 2025-05-07T19:48:32.9926526Z #define __INT_FAST64_WIDTH__ 64 2025-05-07T19:48:32.9926628Z #define __INT_FAST8_FMTd__ "hhd" 2025-05-07T19:48:32.9926724Z #define __INT_FAST8_FMTi__ "hhi" 2025-05-07T19:48:32.9926846Z #define __INT_FAST8_MAX__ 127 2025-05-07T19:48:32.9926954Z #define __INT_FAST8_TYPE__ signed char 2025-05-07T19:48:32.9927052Z #define __INT_FAST8_WIDTH__ 8 2025-05-07T19:48:32.9927177Z #define __INT_LEAST16_FMTd__ "hd" 2025-05-07T19:48:32.9927276Z #define __INT_LEAST16_FMTi__ "hi" 2025-05-07T19:48:32.9927374Z #define __INT_LEAST16_MAX__ 32767 2025-05-07T19:48:32.9927478Z #define __INT_LEAST16_TYPE__ short 2025-05-07T19:48:32.9927600Z #define __INT_LEAST16_WIDTH__ 16 2025-05-07T19:48:32.9927701Z #define __INT_LEAST32_FMTd__ "d" 2025-05-07T19:48:32.9927800Z #define __INT_LEAST32_FMTi__ "i" 2025-05-07T19:48:32.9927926Z #define __INT_LEAST32_MAX__ 2147483647 2025-05-07T19:48:32.9928015Z #define __INT_LEAST32_TYPE__ int 2025-05-07T19:48:32.9928097Z #define __INT_LEAST32_WIDTH__ 32 2025-05-07T19:48:32.9928183Z #define __INT_LEAST64_FMTd__ "ld" 2025-05-07T19:48:32.9928275Z #define __INT_LEAST64_FMTi__ "li" 2025-05-07T19:48:32.9928381Z #define __INT_LEAST64_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9928471Z #define __INT_LEAST64_TYPE__ long int 2025-05-07T19:48:32.9928573Z #define __INT_LEAST64_WIDTH__ 64 2025-05-07T19:48:32.9928660Z #define __INT_LEAST8_FMTd__ "hhd" 2025-05-07T19:48:32.9928748Z #define __INT_LEAST8_FMTi__ "hhi" 2025-05-07T19:48:32.9928833Z #define __INT_LEAST8_MAX__ 127 2025-05-07T19:48:32.9928934Z #define __INT_LEAST8_TYPE__ signed char 2025-05-07T19:48:32.9929019Z #define __INT_LEAST8_WIDTH__ 8 2025-05-07T19:48:32.9929097Z #define __INT_MAX__ 2147483647 2025-05-07T19:48:32.9929260Z #define __INT_WIDTH__ 32 2025-05-07T19:48:32.9929402Z #define __KERNEL_STRICT_NAMES 2025-05-07T19:48:32.9929484Z #define __KEY_T_TYPE __S32_TYPE 2025-05-07T19:48:32.9929570Z #define __LDBL_DECIMAL_DIG__ 21 2025-05-07T19:48:32.9929708Z #define __LDBL_DENORM_MIN__ 3.64519953188247460253e-4951L 2025-05-07T19:48:32.9929785Z #define __LDBL_DIG__ 18 2025-05-07T19:48:32.9929904Z #define __LDBL_EPSILON__ 1.08420217248550443401e-19L 2025-05-07T19:48:32.9929995Z #define __LDBL_HAS_DENORM__ 1 2025-05-07T19:48:32.9930078Z #define __LDBL_HAS_INFINITY__ 1 2025-05-07T19:48:32.9930164Z #define __LDBL_HAS_QUIET_NAN__ 1 2025-05-07T19:48:32.9930246Z #define __LDBL_MANT_DIG__ 64 2025-05-07T19:48:32.9930336Z #define __LDBL_MAX_10_EXP__ 4932 2025-05-07T19:48:32.9930419Z #define __LDBL_MAX_EXP__ 16384 2025-05-07T19:48:32.9930524Z #define __LDBL_MAX__ 1.18973149535723176502e+4932L 2025-05-07T19:48:32.9930618Z #define __LDBL_MIN_10_EXP__ (-4931) 2025-05-07T19:48:32.9930703Z #define __LDBL_MIN_EXP__ (-16381) 2025-05-07T19:48:32.9930813Z #define __LDBL_MIN__ 3.36210314311209350626e-4932L 2025-05-07T19:48:32.9930930Z #define __LDBL_REDIR(name,proto) name proto 2025-05-07T19:48:32.9931055Z #define __LDBL_REDIR1(name,proto,alias) name proto 2025-05-07T19:48:32.9931375Z #define __LDBL_REDIR1_NTH(name,proto,alias) name proto __THROW 2025-05-07T19:48:32.9931464Z #define __LDBL_REDIR_DECL(name) 2025-05-07T19:48:32.9931605Z #define __LDBL_REDIR_NTH(name,proto) name proto __THROW 2025-05-07T19:48:32.9931677Z #define __LEAF 2025-05-07T19:48:32.9931753Z #define __LEAF_ATTR 2025-05-07T19:48:32.9932029Z #define __LIBRARY_TYPES_H__ 2025-05-07T19:48:32.9932120Z #define __LITTLE_ENDIAN 1234 2025-05-07T19:48:32.9932210Z #define __LITTLE_ENDIAN__ 1 2025-05-07T19:48:32.9932298Z #define __LLONG_WIDTH__ 64 2025-05-07T19:48:32.9932417Z #define __LONG_LONG_MAX__ 9223372036854775807LL 2025-05-07T19:48:32.9932520Z #define __LONG_LONG_PAIR(HI,LO) LO, HI 2025-05-07T19:48:32.9932616Z #define __LONG_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9932712Z #define __LONG_WIDTH__ 64 2025-05-07T19:48:32.9932796Z #define __LP64__ 1 2025-05-07T19:48:32.9933130Z #define __MATHCALLX(function,suffix,args,attrib) __MATHDECLX (_Mdouble_,function,suffix, args, attrib) 2025-05-07T19:48:32.9933804Z #define __MATHDECLX(type,function,suffix,args,attrib) __MATHDECL_1(type, function,suffix, args) __attribute__ (attrib); __MATHDECL_1(type, __CONCAT(__,function),suffix, args) __attribute__ (attrib) 2025-05-07T19:48:32.9933909Z #define __MATH_DECLARE_LDOUBLE 1 2025-05-07T19:48:32.9934004Z #define __MATH_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9934095Z #define __MATH_FUNCTIONS_H__ 2025-05-07T19:48:32.9934180Z #define __MMX__ 1 2025-05-07T19:48:32.9934272Z #define __MODE_T_TYPE __U32_TYPE 2025-05-07T19:48:32.9934359Z #define __N(msgid) (msgid) 2025-05-07T19:48:32.9934486Z #define __NFDBITS (8 * (int) sizeof (__fd_mask)) 2025-05-07T19:48:32.9934597Z #define __NLINK_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:32.9934675Z #define __NO_CTYPE 1 2025-05-07T19:48:32.9934756Z #define __NO_INLINE__ 1 2025-05-07T19:48:32.9934853Z #define __NO_MATH_INLINES 1 2025-05-07T19:48:32.9934961Z #define __NTH(fct) __LEAF_ATTR fct throw () 2025-05-07T19:48:32.9935062Z #define __NVCC_DIAG_PRAGMA_SUPPORT__ 1 2025-05-07T19:48:32.9935145Z #define __NVCC__ 1 2025-05-07T19:48:32.9935239Z #define __NV_GLIBCXX_VERSION 40800 2025-05-07T19:48:32.9935336Z #define __NV_NO_HOST_COMPILER_CHECK 1 2025-05-07T19:48:32.9935424Z #define __OBJC_BOOL_IS_BOOL 0 2025-05-07T19:48:32.9935526Z #define __OFF64_T_TYPE __SQUAD_TYPE 2025-05-07T19:48:32.9935619Z #define __OFF_T_MATCHES_OFF64_T 1 2025-05-07T19:48:32.9935723Z #define __OFF_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9935852Z #define __OPENCL_MEMORY_SCOPE_ALL_SVM_DEVICES 3 2025-05-07T19:48:32.9935951Z #define __OPENCL_MEMORY_SCOPE_DEVICE 2 2025-05-07T19:48:32.9936052Z #define __OPENCL_MEMORY_SCOPE_SUB_GROUP 4 2025-05-07T19:48:32.9936159Z #define __OPENCL_MEMORY_SCOPE_WORK_GROUP 1 2025-05-07T19:48:32.9936269Z #define __OPENCL_MEMORY_SCOPE_WORK_ITEM 0 2025-05-07T19:48:32.9936465Z #define __ORDER_BIG_ENDIAN__ 4321 2025-05-07T19:48:32.9936631Z #define __ORDER_LITTLE_ENDIAN__ 1234 2025-05-07T19:48:32.9936730Z #define __ORDER_PDP_ENDIAN__ 3412 2025-05-07T19:48:32.9936812Z #define __P(args) args 2025-05-07T19:48:32.9936901Z #define __PDP_ENDIAN 3412 2025-05-07T19:48:32.9936978Z #define __PIC__ 2 2025-05-07T19:48:32.9937079Z #define __PID_T_TYPE __S32_TYPE 2025-05-07T19:48:32.9937157Z #define __PIE__ 2 2025-05-07T19:48:32.9937240Z #define __PMT(args) args 2025-05-07T19:48:32.9937340Z #define __POINTER_WIDTH__ 64 2025-05-07T19:48:32.9937437Z #define __PRAGMA_REDEFINE_EXTNAME 1 2025-05-07T19:48:32.9937531Z #define __PTHREAD_MUTEX_HAVE_PREV 1 2025-05-07T19:48:32.9937640Z #define __PTHREAD_RWLOCK_INT_FLAGS_SHARED 1 2025-05-07T19:48:32.9937741Z #define __PTHREAD_SPINS 0, 0 2025-05-07T19:48:32.9937835Z #define __PTRDIFF_FMTd__ "ld" 2025-05-07T19:48:32.9937924Z #define __PTRDIFF_FMTi__ "li" 2025-05-07T19:48:32.9938043Z #define __PTRDIFF_MAX__ 9223372036854775807L 2025-05-07T19:48:32.9938139Z #define __PTRDIFF_TYPE__ long int 2025-05-07T19:48:32.9938232Z #define __PTRDIFF_WIDTH__ 64 2025-05-07T19:48:32.9938455Z #define __REDIRECT(name,proto,alias) name proto __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:32.9938680Z #define __REDIRECT_LDBL(name,proto,alias) __REDIRECT (name, proto, alias) 2025-05-07T19:48:32.9938939Z #define __REDIRECT_NTH(name,proto,alias) name proto __THROW __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:32.9939212Z #define __REDIRECT_NTHNL(name,proto,alias) name proto __THROWNL __asm__ (__ASMNAME (#alias)) 2025-05-07T19:48:32.9939462Z #define __REDIRECT_NTH_LDBL(name,proto,alias) __REDIRECT_NTH (name, proto, alias) 2025-05-07T19:48:32.9939555Z #define __REGISTER_PREFIX__ 2025-05-07T19:48:32.9939654Z #define __RLIM64_T_TYPE __UQUAD_TYPE 2025-05-07T19:48:32.9939771Z #define __RLIM_T_TYPE __SYSCALL_ULONG_TYPE 2025-05-07T19:48:32.9939866Z #define __S16_TYPE short int 2025-05-07T19:48:32.9939945Z #define __S32_TYPE int 2025-05-07T19:48:32.9940030Z #define __S64_TYPE long int 2025-05-07T19:48:32.9940130Z #define __SCHAR_MAX__ 127 2025-05-07T19:48:32.9940212Z #define __SEG_FS 1 2025-05-07T19:48:32.9940288Z #define __SEG_GS 1 2025-05-07T19:48:32.9940379Z #define __SHRT_MAX__ 32767 2025-05-07T19:48:32.9940463Z #define __SHRT_WIDTH__ 16 2025-05-07T19:48:32.9940558Z #define __SIG_ATOMIC_MAX__ 2147483647 2025-05-07T19:48:32.9940649Z #define __SIG_ATOMIC_WIDTH__ 32 2025-05-07T19:48:32.9940744Z #define __SIZEOF_DOUBLE__ 8 2025-05-07T19:48:32.9940837Z #define __SIZEOF_FLOAT128__ 16 2025-05-07T19:48:32.9940924Z #define __SIZEOF_FLOAT__ 4 2025-05-07T19:48:32.9941021Z #define __SIZEOF_INT128__ 16 2025-05-07T19:48:32.9941106Z #define __SIZEOF_INT__ 4 2025-05-07T19:48:32.9941199Z #define __SIZEOF_LONG_DOUBLE__ 16 2025-05-07T19:48:32.9941292Z #define __SIZEOF_LONG_LONG__ 8 2025-05-07T19:48:32.9941385Z #define __SIZEOF_LONG__ 8 2025-05-07T19:48:32.9941473Z #define __SIZEOF_POINTER__ 8 2025-05-07T19:48:32.9941568Z #define __SIZEOF_PTHREAD_ATTR_T 56 2025-05-07T19:48:32.9941683Z #define __SIZEOF_PTHREAD_BARRIERATTR_T 4 2025-05-07T19:48:32.9941786Z #define __SIZEOF_PTHREAD_BARRIER_T 32 2025-05-07T19:48:32.9941886Z #define __SIZEOF_PTHREAD_CONDATTR_T 4 2025-05-07T19:48:32.9941981Z #define __SIZEOF_PTHREAD_COND_T 48 2025-05-07T19:48:32.9942091Z #define __SIZEOF_PTHREAD_MUTEXATTR_T 4 2025-05-07T19:48:32.9942186Z #define __SIZEOF_PTHREAD_MUTEX_T 40 2025-05-07T19:48:32.9942288Z #define __SIZEOF_PTHREAD_RWLOCKATTR_T 8 2025-05-07T19:48:32.9942397Z #define __SIZEOF_PTHREAD_RWLOCK_T 56 2025-05-07T19:48:32.9942489Z #define __SIZEOF_PTRDIFF_T__ 8 2025-05-07T19:48:32.9942579Z #define __SIZEOF_SHORT__ 2 2025-05-07T19:48:32.9942684Z #define __SIZEOF_SIZE_T__ 8 2025-05-07T19:48:32.9942774Z #define __SIZEOF_WCHAR_T__ 4 2025-05-07T19:48:32.9942861Z #define __SIZEOF_WINT_T__ 4 2025-05-07T19:48:32.9942949Z #define __SIZE_FMTX__ "lX" 2025-05-07T19:48:32.9943047Z #define __SIZE_FMTo__ "lo" 2025-05-07T19:48:32.9943131Z #define __SIZE_FMTu__ "lu" 2025-05-07T19:48:32.9943214Z #define __SIZE_FMTx__ "lx" 2025-05-07T19:48:32.9943519Z #define __SIZE_MAX__ 18446744073709551615UL 2025-05-07T19:48:32.9943666Z #define __SIZE_TYPE__ long unsigned int 2025-05-07T19:48:32.9943747Z #define __SIZE_WIDTH__ 64 2025-05-07T19:48:32.9943833Z #define __SLONG32_TYPE int 2025-05-07T19:48:32.9943942Z #define __SLONGWORD_TYPE long int 2025-05-07T19:48:32.9944160Z #define __SM_20_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9944254Z #define __SM_20_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:32.9944355Z #define __SM_20_INTRINSICS_HPP__ 2025-05-07T19:48:32.9944439Z #define __SM_20_INTRINSICS_H__ 2025-05-07T19:48:32.9944530Z #define __SM_30_INTRINSICS_HPP__ 2025-05-07T19:48:32.9944614Z #define __SM_30_INTRINSICS_H__ 2025-05-07T19:48:32.9944715Z #define __SM_32_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9944804Z #define __SM_32_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:32.9944890Z #define __SM_32_INTRINSICS_HPP__ 2025-05-07T19:48:32.9944986Z #define __SM_32_INTRINSICS_H__ 2025-05-07T19:48:32.9945080Z #define __SM_35_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:32.9945166Z #define __SM_35_INTRINSICS_H__ 2025-05-07T19:48:32.9945260Z #define __SM_60_ATOMIC_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9945356Z #define __SM_60_ATOMIC_FUNCTIONS_H__ 2025-05-07T19:48:32.9945440Z #define __SM_61_INTRINSICS_HPP__ 2025-05-07T19:48:32.9945525Z #define __SM_61_INTRINSICS_H__ 2025-05-07T19:48:32.9945612Z #define __SM_70_RT_HPP__ 2025-05-07T19:48:32.9945687Z #define __SM_70_RT_H__ 2025-05-07T19:48:32.9945762Z #define __SM_80_RT_HPP__ 2025-05-07T19:48:32.9945836Z #define __SM_80_RT_H__ 2025-05-07T19:48:32.9945929Z #define __SM_90_RT_HPP__ 2025-05-07T19:48:32.9946002Z #define __SM_90_RT_H__ 2025-05-07T19:48:32.9946085Z #define __SQUAD_TYPE long int 2025-05-07T19:48:32.9946166Z #define __SSE2_MATH__ 1 2025-05-07T19:48:32.9946240Z #define __SSE2__ 1 2025-05-07T19:48:32.9946316Z #define __SSE_MATH__ 1 2025-05-07T19:48:32.9946387Z #define __SSE__ 1 2025-05-07T19:48:32.9946487Z #define __SSIZE_T_TYPE __SWORD_TYPE 2025-05-07T19:48:32.9946598Z #define __STDCPP_DEFAULT_NEW_ALIGNMENT__ 16UL 2025-05-07T19:48:32.9946700Z #define __STDCPP_MATH_SPEC_FUNCS__ 201003L 2025-05-07T19:48:32.9946794Z #define __STDCPP_THREADS__ 1 2025-05-07T19:48:32.9946879Z #define __STDC_HOSTED__ 1 2025-05-07T19:48:32.9946966Z #define __STDC_IEC_559_COMPLEX__ 1 2025-05-07T19:48:32.9947046Z #define __STDC_IEC_559__ 1 2025-05-07T19:48:32.9947136Z #define __STDC_ISO_10646__ 201103L 2025-05-07T19:48:32.9947220Z #define __STDC_NO_THREADS__ 1 2025-05-07T19:48:32.9947296Z #define __STDC_UTF_16__ 1 2025-05-07T19:48:32.9947381Z #define __STDC_UTF_32__ 1 2025-05-07T19:48:32.9947454Z #define __STDC__ 1 2025-05-07T19:48:32.9947525Z #define __STDDEF_H 2025-05-07T19:48:32.9947601Z #define __STRING(x) #x 2025-05-07T19:48:32.9947704Z #define __SURFACE_FUNCTIONS_H__ 2025-05-07T19:48:32.9947810Z #define __SURFACE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:32.9947895Z #define __SURFACE_TYPES_H__ 2025-05-07T19:48:32.9948017Z #define __SUSECONDS_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9948102Z #define __SWORD_TYPE long int 2025-05-07T19:48:32.9948215Z #define __SYSCALL_SLONG_TYPE __SLONGWORD_TYPE 2025-05-07T19:48:32.9948325Z #define __SYSCALL_ULONG_TYPE __ULONGWORD_TYPE 2025-05-07T19:48:32.9948415Z #define __SYSCALL_WORDSIZE 64 2025-05-07T19:48:32.9948508Z #define __TEXTURE_FETCH_FUNCTIONS_H__ 2025-05-07T19:48:32.9948603Z #define __TEXTURE_INDIRECT_FUNCTIONS_H__ 2025-05-07T19:48:32.9948694Z #define __TEXTURE_TYPES_H__ 2025-05-07T19:48:32.9948772Z #define __THROW throw () 2025-05-07T19:48:32.9948855Z #define __THROWNL throw () 2025-05-07T19:48:32.9948937Z #define __TIMER_T_TYPE void * 2025-05-07T19:48:32.9949041Z #define __TIME_T_TYPE __SYSCALL_SLONG_TYPE 2025-05-07T19:48:32.9949133Z #define __U16_TYPE unsigned short int 2025-05-07T19:48:32.9949219Z #define __U32_TYPE unsigned int 2025-05-07T19:48:32.9949312Z #define __U64_TYPE unsigned long int 2025-05-07T19:48:32.9949395Z #define __UID_T_TYPE __U32_TYPE 2025-05-07T19:48:32.9949478Z #define __UINT16_C_SUFFIX__ 2025-05-07T19:48:32.9949562Z #define __UINT16_FMTX__ "hX" 2025-05-07T19:48:32.9949705Z #define __UINT16_FMTo__ "ho" 2025-05-07T19:48:32.9949841Z #define __UINT16_FMTu__ "hu" 2025-05-07T19:48:32.9949919Z #define __UINT16_FMTx__ "hx" 2025-05-07T19:48:32.9950002Z #define __UINT16_MAX__ 65535 2025-05-07T19:48:32.9950092Z #define __UINT16_TYPE__ unsigned short 2025-05-07T19:48:32.9950173Z #define __UINT32_C_SUFFIX__ U 2025-05-07T19:48:32.9950252Z #define __UINT32_FMTX__ "X" 2025-05-07T19:48:32.9950337Z #define __UINT32_FMTo__ "o" 2025-05-07T19:48:32.9950415Z #define __UINT32_FMTu__ "u" 2025-05-07T19:48:32.9950494Z #define __UINT32_FMTx__ "x" 2025-05-07T19:48:32.9950590Z #define __UINT32_MAX__ 4294967295U 2025-05-07T19:48:32.9950679Z #define __UINT32_TYPE__ unsigned int 2025-05-07T19:48:32.9950763Z #define __UINT64_C_SUFFIX__ UL 2025-05-07T19:48:32.9950847Z #define __UINT64_FMTX__ "lX" 2025-05-07T19:48:32.9950932Z #define __UINT64_FMTo__ "lo" 2025-05-07T19:48:32.9951014Z #define __UINT64_FMTu__ "lu" 2025-05-07T19:48:32.9951094Z #define __UINT64_FMTx__ "lx" 2025-05-07T19:48:32.9951204Z #define __UINT64_MAX__ 18446744073709551615UL 2025-05-07T19:48:32.9951303Z #define __UINT64_TYPE__ long unsigned int 2025-05-07T19:48:32.9951384Z #define __UINT8_C_SUFFIX__ 2025-05-07T19:48:32.9951471Z #define __UINT8_FMTX__ "hhX" 2025-05-07T19:48:32.9951558Z #define __UINT8_FMTo__ "hho" 2025-05-07T19:48:32.9951637Z #define __UINT8_FMTu__ "hhu" 2025-05-07T19:48:32.9951720Z #define __UINT8_FMTx__ "hhx" 2025-05-07T19:48:32.9951810Z #define __UINT8_MAX__ 255 2025-05-07T19:48:32.9951898Z #define __UINT8_TYPE__ unsigned char 2025-05-07T19:48:32.9951980Z #define __UINTMAX_C_SUFFIX__ UL 2025-05-07T19:48:32.9952069Z #define __UINTMAX_FMTX__ "lX" 2025-05-07T19:48:32.9952159Z #define __UINTMAX_FMTo__ "lo" 2025-05-07T19:48:32.9952240Z #define __UINTMAX_FMTu__ "lu" 2025-05-07T19:48:32.9952410Z #define __UINTMAX_FMTx__ "lx" 2025-05-07T19:48:32.9952522Z #define __UINTMAX_MAX__ 18446744073709551615UL 2025-05-07T19:48:32.9952620Z #define __UINTMAX_TYPE__ long unsigned int 2025-05-07T19:48:32.9952875Z #define __UINTMAX_WIDTH__ 64 2025-05-07T19:48:32.9952984Z #define __UINTPTR_FMTX__ "lX" 2025-05-07T19:48:32.9953082Z #define __UINTPTR_FMTo__ "lo" 2025-05-07T19:48:32.9953171Z #define __UINTPTR_FMTu__ "lu" 2025-05-07T19:48:32.9953260Z #define __UINTPTR_FMTx__ "lx" 2025-05-07T19:48:32.9953377Z #define __UINTPTR_MAX__ 18446744073709551615UL 2025-05-07T19:48:32.9953485Z #define __UINTPTR_TYPE__ long unsigned int 2025-05-07T19:48:32.9953576Z #define __UINTPTR_WIDTH__ 64 2025-05-07T19:48:32.9953679Z #define __UINT_FAST16_FMTX__ "hX" 2025-05-07T19:48:32.9953771Z #define __UINT_FAST16_FMTo__ "ho" 2025-05-07T19:48:32.9953864Z #define __UINT_FAST16_FMTu__ "hu" 2025-05-07T19:48:32.9953957Z #define __UINT_FAST16_FMTx__ "hx" 2025-05-07T19:48:32.9954055Z #define __UINT_FAST16_MAX__ 65535 2025-05-07T19:48:32.9954163Z #define __UINT_FAST16_TYPE__ unsigned short 2025-05-07T19:48:32.9954253Z #define __UINT_FAST32_FMTX__ "X" 2025-05-07T19:48:32.9954349Z #define __UINT_FAST32_FMTo__ "o" 2025-05-07T19:48:32.9954437Z #define __UINT_FAST32_FMTu__ "u" 2025-05-07T19:48:32.9954527Z #define __UINT_FAST32_FMTx__ "x" 2025-05-07T19:48:32.9954626Z #define __UINT_FAST32_MAX__ 4294967295U 2025-05-07T19:48:32.9954738Z #define __UINT_FAST32_TYPE__ unsigned int 2025-05-07T19:48:32.9954827Z #define __UINT_FAST64_FMTX__ "lX" 2025-05-07T19:48:32.9954918Z #define __UINT_FAST64_FMTo__ "lo" 2025-05-07T19:48:32.9955013Z #define __UINT_FAST64_FMTu__ "lu" 2025-05-07T19:48:32.9955101Z #define __UINT_FAST64_FMTx__ "lx" 2025-05-07T19:48:32.9955219Z #define __UINT_FAST64_MAX__ 18446744073709551615UL 2025-05-07T19:48:32.9955335Z #define __UINT_FAST64_TYPE__ long unsigned int 2025-05-07T19:48:32.9955435Z #define __UINT_FAST8_FMTX__ "hhX" 2025-05-07T19:48:32.9955523Z #define __UINT_FAST8_FMTo__ "hho" 2025-05-07T19:48:32.9955615Z #define __UINT_FAST8_FMTu__ "hhu" 2025-05-07T19:48:32.9955714Z #define __UINT_FAST8_FMTx__ "hhx" 2025-05-07T19:48:32.9955806Z #define __UINT_FAST8_MAX__ 255 2025-05-07T19:48:32.9955905Z #define __UINT_FAST8_TYPE__ unsigned char 2025-05-07T19:48:32.9956064Z #define __UINT_LEAST16_FMTX__ "hX" 2025-05-07T19:48:32.9956219Z #define __UINT_LEAST16_FMTo__ "ho" 2025-05-07T19:48:32.9956308Z #define __UINT_LEAST16_FMTu__ "hu" 2025-05-07T19:48:32.9956396Z #define __UINT_LEAST16_FMTx__ "hx" 2025-05-07T19:48:32.9956497Z #define __UINT_LEAST16_MAX__ 65535 2025-05-07T19:48:32.9956610Z #define __UINT_LEAST16_TYPE__ unsigned short 2025-05-07T19:48:32.9956701Z #define __UINT_LEAST32_FMTX__ "X" 2025-05-07T19:48:32.9956803Z #define __UINT_LEAST32_FMTo__ "o" 2025-05-07T19:48:32.9956892Z #define __UINT_LEAST32_FMTu__ "u" 2025-05-07T19:48:32.9956980Z #define __UINT_LEAST32_FMTx__ "x" 2025-05-07T19:48:32.9957078Z #define __UINT_LEAST32_MAX__ 4294967295U 2025-05-07T19:48:32.9957190Z #define __UINT_LEAST32_TYPE__ unsigned int 2025-05-07T19:48:32.9957285Z #define __UINT_LEAST64_FMTX__ "lX" 2025-05-07T19:48:32.9957376Z #define __UINT_LEAST64_FMTo__ "lo" 2025-05-07T19:48:32.9957481Z #define __UINT_LEAST64_FMTu__ "lu" 2025-05-07T19:48:32.9957570Z #define __UINT_LEAST64_FMTx__ "lx" 2025-05-07T19:48:32.9957696Z #define __UINT_LEAST64_MAX__ 18446744073709551615UL 2025-05-07T19:48:32.9957819Z #define __UINT_LEAST64_TYPE__ long unsigned int 2025-05-07T19:48:32.9957919Z #define __UINT_LEAST8_FMTX__ "hhX" 2025-05-07T19:48:32.9958008Z #define __UINT_LEAST8_FMTo__ "hho" 2025-05-07T19:48:32.9958100Z #define __UINT_LEAST8_FMTu__ "hhu" 2025-05-07T19:48:32.9958196Z #define __UINT_LEAST8_FMTx__ "hhx" 2025-05-07T19:48:32.9958287Z #define __UINT_LEAST8_MAX__ 255 2025-05-07T19:48:32.9958393Z #define __UINT_LEAST8_TYPE__ unsigned char 2025-05-07T19:48:32.9958490Z #define __ULONG32_TYPE unsigned int 2025-05-07T19:48:32.9958612Z #define __ULONGWORD_TYPE unsigned long int 2025-05-07T19:48:32.9958719Z #define __UQUAD_TYPE unsigned long int 2025-05-07T19:48:32.9958815Z #define __USECONDS_T_TYPE __U32_TYPE 2025-05-07T19:48:32.9958913Z #define __USER_LABEL_PREFIX__ 2025-05-07T19:48:32.9958996Z #define __USE_ANSI 1 2025-05-07T19:48:32.9959078Z #define __USE_ATFILE 1 2025-05-07T19:48:32.9959156Z #define __USE_BSD 1 2025-05-07T19:48:32.9959260Z #define __USE_FORTIFY_LEVEL 0 2025-05-07T19:48:32.9959345Z #define __USE_GNU 1 2025-05-07T19:48:32.9959424Z #define __USE_ISOC11 1 2025-05-07T19:48:32.9959520Z #define __USE_ISOC95 1 2025-05-07T19:48:32.9959599Z #define __USE_ISOC99 1 2025-05-07T19:48:32.9959684Z #define __USE_ISOCXX11 1 2025-05-07T19:48:32.9959769Z #define __USE_LARGEFILE 1 2025-05-07T19:48:32.9959869Z #define __USE_LARGEFILE64 1 2025-05-07T19:48:32.9959946Z #define __USE_MISC 1 2025-05-07T19:48:32.9960029Z #define __USE_POSIX 1 2025-05-07T19:48:32.9960129Z #define __USE_POSIX199309 1 2025-05-07T19:48:32.9960217Z #define __USE_POSIX199506 1 2025-05-07T19:48:32.9960301Z #define __USE_POSIX2 1 2025-05-07T19:48:32.9960382Z #define __USE_SVID 1 2025-05-07T19:48:32.9960476Z #define __USE_UNIX98 1 2025-05-07T19:48:32.9960562Z #define __USE_XOPEN 1 2025-05-07T19:48:32.9960646Z #define __USE_XOPEN2K 1 2025-05-07T19:48:32.9960735Z #define __USE_XOPEN2K8 1 2025-05-07T19:48:32.9960822Z #define __USE_XOPEN2K8XSI 1 2025-05-07T19:48:32.9960919Z #define __USE_XOPEN2KXSI 1 2025-05-07T19:48:32.9961012Z #define __USE_XOPEN_EXTENDED 1 2025-05-07T19:48:32.9961115Z #define __USING_NAMESPACE_C99(name) 2025-05-07T19:48:32.9961212Z #define __USING_NAMESPACE_STD(name) 2025-05-07T19:48:32.9961313Z #define __UWORD_TYPE unsigned long int 2025-05-07T19:48:32.9961415Z #define __VECTOR_FUNCTIONS_HPP__ 2025-05-07T19:48:32.9961508Z #define __VECTOR_FUNCTIONS_H__ 2025-05-07T19:48:32.9961595Z #define __VECTOR_TYPES_H__ 2025-05-07T19:48:32.9962052Z #define __VERSION__ "Clang 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:48:32.9962179Z #define __WAIT_INT(status) (*(int *) &(status)) 2025-05-07T19:48:32.9962272Z #define __WAIT_STATUS void * 2025-05-07T19:48:32.9962364Z #define __WAIT_STATUS_DEFN void * 2025-05-07T19:48:32.9962457Z #define __WALL 0x40000000 2025-05-07T19:48:32.9962545Z #define __WCHAR_MAX__ 2147483647 2025-05-07T19:48:32.9962636Z #define __WCHAR_TYPE__ int 2025-05-07T19:48:32.9962774Z #define __WCHAR_WIDTH__ 32 2025-05-07T19:48:32.9962934Z #define __WCLONE 0x80000000 2025-05-07T19:48:32.9963069Z #define __WCOREDUMP(status) ((status) & __WCOREFLAG) 2025-05-07T19:48:32.9963153Z #define __WCOREFLAG 0x80 2025-05-07T19:48:32.9963308Z #define __WEXITSTATUS(status) (((status) & 0xff00) >> 8) 2025-05-07T19:48:32.9963463Z #define __WIFCONTINUED(status) ((status) == __W_CONTINUED) 2025-05-07T19:48:32.9963599Z #define __WIFEXITED(status) (__WTERMSIG(status) == 0) 2025-05-07T19:48:32.9963831Z #define __WIFSIGNALED(status) (((signed char) (((status) & 0x7f) + 1) >> 1) > 0) 2025-05-07T19:48:32.9963978Z #define __WIFSTOPPED(status) (((status) & 0xff) == 0x7f) 2025-05-07T19:48:32.9964068Z #define __WINT_MAX__ 4294967295U 2025-05-07T19:48:32.9964167Z #define __WINT_TYPE__ unsigned int 2025-05-07T19:48:32.9964260Z #define __WINT_UNSIGNED__ 1 2025-05-07T19:48:32.9964344Z #define __WINT_WIDTH__ 32 2025-05-07T19:48:32.9964434Z #define __WNOTHREAD 0x20000000 2025-05-07T19:48:32.9964531Z #define __WORDSIZE 64 2025-05-07T19:48:32.9964631Z #define __WORDSIZE_TIME64_COMPAT32 1 2025-05-07T19:48:32.9964762Z #define __WSTOPSIG(status) __WEXITSTATUS(status) 2025-05-07T19:48:32.9964873Z #define __WTERMSIG(status) ((status) & 0x7f) 2025-05-07T19:48:32.9965088Z #define __W_CONTINUED 0xffff 2025-05-07T19:48:32.9965206Z #define __W_EXITCODE(ret,sig) ((ret) << 8 | (sig)) 2025-05-07T19:48:32.9965313Z #define __W_STOPCODE(sig) ((sig) << 8 | 0x7f) 2025-05-07T19:48:32.9965403Z #define ____FILE_defined 1 2025-05-07T19:48:32.9965486Z #define ____mbstate_t_defined 1 2025-05-07T19:48:32.9965602Z #define __align__(n) __attribute__((aligned(n))) 2025-05-07T19:48:32.9965774Z #define __always_inline __inline __attribute__ ((__always_inline__)) 2025-05-07T19:48:32.9965859Z #define __amd64 1 2025-05-07T19:48:32.9965930Z #define __amd64__ 1 2025-05-07T19:48:32.9966034Z #define __annotate__(a) __attribute__((a)) 2025-05-07T19:48:32.9966132Z #define __attribute_artificial__ 2025-05-07T19:48:32.9966273Z #define __attribute_const__ __attribute__ ((__const__)) 2025-05-07T19:48:32.9966450Z #define __attribute_deprecated__ __attribute__ ((__deprecated__)) 2025-05-07T19:48:32.9966650Z #define __attribute_format_arg__(x) __attribute__ ((__format_arg__ (x))) 2025-05-07T19:48:32.9966893Z #define __attribute_format_strfmon__(a,b) __attribute__ ((__format__ (__strfmon__, a, b))) 2025-05-07T19:48:32.9967033Z #define __attribute_malloc__ __attribute__ ((__malloc__)) 2025-05-07T19:48:32.9967186Z #define __attribute_noinline__ __attribute__ ((__noinline__)) 2025-05-07T19:48:32.9967325Z #define __attribute_pure__ __attribute__ ((__pure__)) 2025-05-07T19:48:32.9967449Z #define __attribute_used__ __attribute__ ((__used__)) 2025-05-07T19:48:32.9967668Z #define __attribute_warn_unused_result__ __attribute__ ((__warn_unused_result__)) 2025-05-07T19:48:32.9967759Z #define __blkcnt_t_defined 2025-05-07T19:48:32.9967846Z #define __blksize_t_defined 2025-05-07T19:48:32.9968026Z #define __bos(ptr) __builtin_object_size (ptr, __USE_FORTIFY_LEVEL > 1) 2025-05-07T19:48:32.9968153Z #define __bos0(ptr) __builtin_object_size (ptr, 0) 2025-05-07T19:48:32.9968233Z #define __bounded 2025-05-07T19:48:32.9968827Z #define __bswap_16(x) (__extension__ ({ unsigned short int __v, __x = (unsigned short int) (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_16 (__x); else __asm__ ("rorw $8, %w0" : "=r" (__v) : "0" (__x) : "cc"); __v; })) 2025-05-07T19:48:32.9969309Z #define __bswap_32(x) (__extension__ ({ unsigned int __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_32 (__x); else __asm__ ("bswap %0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:48:32.9969763Z #define __bswap_64(x) (__extension__ ({ __uint64_t __v, __x = (x); if (__builtin_constant_p (__x)) __v = __bswap_constant_64 (__x); else __asm__ ("bswap %q0" : "=r" (__v) : "0" (__x)); __v; })) 2025-05-07T19:48:32.9970013Z #define __bswap_constant_16(x) ((unsigned short int) ((((x) >> 8) & 0xff) | (((x) & 0xff) << 8))) 2025-05-07T19:48:32.9970386Z #define __bswap_constant_32(x) ((((x) & 0xff000000) >> 24) | (((x) & 0x00ff0000) >> 8) | (((x) & 0x0000ff00) << 8) | (((x) & 0x000000ff) << 24)) 2025-05-07T19:48:32.9971364Z #define __bswap_constant_64(x) (__extension__ ((((x) & 0xff00000000000000ull) >> 56) | (((x) & 0x00ff000000000000ull) >> 40) | (((x) & 0x0000ff0000000000ull) >> 24) | (((x) & 0x000000ff00000000ull) >> 8) | (((x) & 0x00000000ff000000ull) << 8) | (((x) & 0x0000000000ff0000ull) << 24) | (((x) & 0x000000000000ff00ull) << 40) | (((x) & 0x00000000000000ffull) << 56))) 2025-05-07T19:48:32.9971468Z #define __builtin_align__(a) __align__(a) 2025-05-07T19:48:32.9971560Z #define __catch(X) catch(X) 2025-05-07T19:48:32.9971632Z #define __cdecl 2025-05-07T19:48:32.9971706Z #define __clang__ 1 2025-05-07T19:48:32.9971808Z #define __clang_literal_encoding__ "UTF-8" 2025-05-07T19:48:32.9971894Z #define __clang_major__ 16 2025-05-07T19:48:32.9971974Z #define __clang_minor__ 0 2025-05-07T19:48:32.9972060Z #define __clang_patchlevel__ 6 2025-05-07T19:48:32.9972476Z #define __clang_version__ "16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4)" 2025-05-07T19:48:32.9972597Z #define __clang_wide_literal_encoding__ "UTF-32" 2025-05-07T19:48:32.9972684Z #define __clock_t_defined 1 2025-05-07T19:48:32.9972800Z #define __clockid_t_defined 1 2025-05-07T19:48:32.9973000Z #define __cluster_dims__(...) __attribute__((cluster_dims(__VA_ARGS__))) 2025-05-07T19:48:32.9973105Z #define __code_model_small__ 1 2025-05-07T19:48:32.9973223Z #define __constant__ __location__(constant) 2025-05-07T19:48:32.9973354Z #define __cplusplus 201703L 2025-05-07T19:48:32.9973466Z #define __cpp_aggregate_bases 201603L 2025-05-07T19:48:32.9973576Z #define __cpp_aggregate_nsdmi 201304L 2025-05-07T19:48:32.9973714Z #define __cpp_alias_templates 200704L 2025-05-07T19:48:32.9973823Z #define __cpp_aligned_new 201606L 2025-05-07T19:48:32.9973931Z #define __cpp_attributes 200809L 2025-05-07T19:48:32.9974041Z #define __cpp_binary_literals 201304L 2025-05-07T19:48:32.9974179Z #define __cpp_capture_star_this 201603L 2025-05-07T19:48:32.9974297Z #define __cpp_constexpr 201603L 2025-05-07T19:48:32.9974421Z #define __cpp_constexpr_in_decltype 201711L 2025-05-07T19:48:32.9974545Z #define __cpp_decltype 200707L 2025-05-07T19:48:32.9974655Z #define __cpp_decltype_auto 201304L 2025-05-07T19:48:32.9974769Z #define __cpp_deduction_guides 201703L 2025-05-07T19:48:32.9974900Z #define __cpp_delegating_constructors 200604L 2025-05-07T19:48:32.9975042Z #define __cpp_digit_separators 201309L 2025-05-07T19:48:32.9975169Z #define __cpp_enumerator_attributes 201411L 2025-05-07T19:48:32.9975280Z #define __cpp_exceptions 199711L 2025-05-07T19:48:32.9975417Z #define __cpp_fold_expressions 201603L 2025-05-07T19:48:32.9975529Z #define __cpp_generic_lambdas 201304L 2025-05-07T19:48:32.9975657Z #define __cpp_guaranteed_copy_elision 201606L 2025-05-07T19:48:32.9975787Z #define __cpp_hex_float 201603L 2025-05-07T19:48:32.9975899Z #define __cpp_if_constexpr 201606L 2025-05-07T19:48:32.9976007Z #define __cpp_impl_destroying_delete 201806L 2025-05-07T19:48:32.9976135Z #define __cpp_inheriting_constructors 201511L 2025-05-07T19:48:32.9976278Z #define __cpp_init_captures 201304L 2025-05-07T19:48:32.9976399Z #define __cpp_initializer_lists 200806L 2025-05-07T19:48:32.9976512Z #define __cpp_inline_variables 201606L 2025-05-07T19:48:32.9976648Z #define __cpp_lambdas 200907L 2025-05-07T19:48:32.9976778Z #define __cpp_lib_addressof_constexpr 201603 2025-05-07T19:48:32.9976897Z #define __cpp_lib_array_constexpr 201803L 2025-05-07T19:48:32.9977007Z #define __cpp_lib_as_const 201510 2025-05-07T19:48:32.9977147Z #define __cpp_lib_bool_constant 201505 2025-05-07T19:48:32.9977271Z #define __cpp_lib_exchange_function 201304 2025-05-07T19:48:32.9977442Z #define __cpp_lib_has_unique_object_representations 201606 2025-05-07T19:48:32.9977577Z #define __cpp_lib_hypot 201603 2025-05-07T19:48:32.9977693Z #define __cpp_lib_integer_sequence 201304 2025-05-07T19:48:32.9977827Z #define __cpp_lib_integral_constant_callable 201304 2025-05-07T19:48:32.9977994Z #define __cpp_lib_is_aggregate 201703 2025-05-07T19:48:32.9978180Z #define __cpp_lib_is_final 201402L 2025-05-07T19:48:32.9978291Z #define __cpp_lib_is_invocable 201703 2025-05-07T19:48:32.9978410Z #define __cpp_lib_is_null_pointer 201309 2025-05-07T19:48:32.9978553Z #define __cpp_lib_is_swappable 201603 2025-05-07T19:48:32.9978664Z #define __cpp_lib_launder 201606 2025-05-07T19:48:32.9978774Z #define __cpp_lib_logical_traits 201510 2025-05-07T19:48:32.9978904Z #define __cpp_lib_make_reverse_iterator 201402 2025-05-07T19:48:32.9979069Z #define __cpp_lib_math_special_functions 201603L 2025-05-07T19:48:32.9979167Z #define __cpp_lib_result_of_sfinae 201210 2025-05-07T19:48:32.9979308Z #define __cpp_lib_robust_nonmodifying_seq_ops 201304 2025-05-07T19:48:32.9979476Z #define __cpp_lib_transformation_trait_aliases 201304 2025-05-07T19:48:32.9979591Z #define __cpp_lib_tuple_element_t 201402L 2025-05-07T19:48:32.9979701Z #define __cpp_lib_tuples_by_type 201304 2025-05-07T19:48:32.9979877Z #define __cpp_lib_type_trait_variable_templates 201510L 2025-05-07T19:48:32.9979984Z #define __cpp_lib_void_t 201411 2025-05-07T19:48:32.9980107Z #define __cpp_named_character_escapes 202207L 2025-05-07T19:48:32.9980225Z #define __cpp_namespace_attributes 201411L 2025-05-07T19:48:32.9980383Z #define __cpp_nested_namespace_definitions 201411L 2025-05-07T19:48:32.9980503Z #define __cpp_noexcept_function_type 201510L 2025-05-07T19:48:32.9980615Z #define __cpp_nontype_template_args 201411L 2025-05-07T19:48:32.9980780Z #define __cpp_nontype_template_parameter_auto 201606L 2025-05-07T19:48:32.9980879Z #define __cpp_nsdmi 200809L 2025-05-07T19:48:32.9980986Z #define __cpp_range_based_for 201603L 2025-05-07T19:48:32.9981093Z #define __cpp_raw_strings 200710L 2025-05-07T19:48:32.9981195Z #define __cpp_ref_qualifiers 200710L 2025-05-07T19:48:32.9981297Z #define __cpp_return_type_deduction 201304L 2025-05-07T19:48:32.9981378Z #define __cpp_rtti 199711L 2025-05-07T19:48:32.9981483Z #define __cpp_rvalue_references 200610L 2025-05-07T19:48:32.9981574Z #define __cpp_static_assert 201411L 2025-05-07T19:48:32.9981678Z #define __cpp_static_call_operator 202207L 2025-05-07T19:48:32.9981788Z #define __cpp_structured_bindings 201606L 2025-05-07T19:48:32.9981887Z #define __cpp_template_auto 201606L 2025-05-07T19:48:32.9981997Z #define __cpp_threadsafe_static_init 200806L 2025-05-07T19:48:32.9982094Z #define __cpp_unicode_characters 200704L 2025-05-07T19:48:32.9982195Z #define __cpp_unicode_literals 200710L 2025-05-07T19:48:32.9982299Z #define __cpp_user_defined_literals 200809L 2025-05-07T19:48:32.9982394Z #define __cpp_variable_templates 201304L 2025-05-07T19:48:32.9982499Z #define __cpp_variadic_templates 200704L 2025-05-07T19:48:32.9982594Z #define __cpp_variadic_using 201611L 2025-05-07T19:48:32.9982694Z #define __cudaGet_blockDim() blockDim 2025-05-07T19:48:32.9982784Z #define __cudaGet_blockIdx() blockIdx 2025-05-07T19:48:32.9982878Z #define __cudaGet_gridDim() gridDim 2025-05-07T19:48:32.9982975Z #define __cudaGet_threadIdx() threadIdx 2025-05-07T19:48:32.9983073Z #define __cudaGet_warpSize() warpSize 2025-05-07T19:48:32.9983216Z #define __cudart_builtin__ __location__(cudart_builtin) 2025-05-07T19:48:32.9983298Z #define __daddr_t_defined 2025-05-07T19:48:32.9983377Z #define __dev_t_defined 2025-05-07T19:48:32.9983467Z #define __device__ __location__(device) 2025-05-07T19:48:32.9983606Z #define __device_builtin__ __location__(device_builtin) 2025-05-07T19:48:32.9983827Z #define __device_builtin_surface_type__ __location__(device_builtin_surface_type) 2025-05-07T19:48:32.9984048Z #define __device_builtin_texture_type__ __location__(device_builtin_texture_type) 2025-05-07T19:48:32.9984188Z #define __errordecl(name,msg) extern void name (void) 2025-05-07T19:48:32.9984276Z #define __export__ 2025-05-07T19:48:32.9984522Z #define __extern_always_inline extern __always_inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:32.9984715Z #define __extern_inline extern __inline __attribute__ ((__gnu_inline__)) 2025-05-07T19:48:32.9984790Z #define __flexarr [] 2025-05-07T19:48:32.9985013Z #define __forceinline__ __inline__ __attribute__((always_inline)) 2025-05-07T19:48:32.9985267Z #define __fortify_function __extern_always_inline __attribute_artificial__ 2025-05-07T19:48:32.9985358Z #define __fsblkcnt_t_defined 2025-05-07T19:48:32.9985441Z #define __fsfilcnt_t_defined 2025-05-07T19:48:32.9985520Z #define __gid_t_defined 2025-05-07T19:48:32.9985668Z #define __glibc_likely(cond) __builtin_expect((cond), 1) 2025-05-07T19:48:32.9985810Z #define __glibc_unlikely(cond) __builtin_expect((cond), 0) 2025-05-07T19:48:32.9986038Z #define __glibcxx_assert(cond) do { __glibcxx_constexpr_assert(cond); } while (false) 2025-05-07T19:48:32.9986144Z #define __glibcxx_class_requires(_a,_b) 2025-05-07T19:48:32.9986249Z #define __glibcxx_class_requires2(_a,_b,_c) 2025-05-07T19:48:32.9986360Z #define __glibcxx_class_requires3(_a,_b,_c,_d) 2025-05-07T19:48:32.9986478Z #define __glibcxx_class_requires4(_a,_b,_c,_d,_e) 2025-05-07T19:48:32.9986844Z #define __glibcxx_constexpr_assert(cond) if (__builtin_is_constant_evaluated() && !bool(cond)) __builtin_unreachable() 2025-05-07T19:48:32.9987033Z #define __glibcxx_digits10_b(T,B) (__glibcxx_digits_b (T,B) * 643L / 2136) 2025-05-07T19:48:32.9987189Z #define __glibcxx_digits_b(T,B) (B - __glibcxx_signed_b (T,B)) 2025-05-07T19:48:32.9987301Z #define __glibcxx_function_requires(...) 2025-05-07T19:48:32.9987399Z #define __glibcxx_integral_traps true 2025-05-07T19:48:32.9987696Z #define __glibcxx_max_b(T,B) (__glibcxx_signed_b (T,B) ? (((((T)1 << (__glibcxx_digits_b (T,B) - 1)) - 1) << 1) + 1) : ~(T)0) 2025-05-07T19:48:32.9987941Z #define __glibcxx_min_b(T,B) (__glibcxx_signed_b (T,B) ? -__glibcxx_max_b (T,B) - 1 : (T)0) 2025-05-07T19:48:32.9988130Z #define __glibcxx_requires_can_decrement_range(_First1,_Last1,_First2) 2025-05-07T19:48:32.9988269Z #define __glibcxx_requires_can_increment(_First,_Size) 2025-05-07T19:48:32.9988457Z #define __glibcxx_requires_can_increment_range(_First1,_Last1,_First2) 2025-05-07T19:48:32.9988584Z #define __glibcxx_requires_cond(_Cond,_Msg) 2025-05-07T19:48:32.9988697Z #define __glibcxx_requires_heap(_First,_Last) 2025-05-07T19:48:32.9988841Z #define __glibcxx_requires_heap_pred(_First,_Last,_Pred) 2025-05-07T19:48:32.9988979Z #define __glibcxx_requires_irreflexive(_First,_Last) 2025-05-07T19:48:32.9989114Z #define __glibcxx_requires_irreflexive2(_First,_Last) 2025-05-07T19:48:32.9989282Z #define __glibcxx_requires_irreflexive_pred(_First,_Last,_Pred) 2025-05-07T19:48:32.9989475Z #define __glibcxx_requires_irreflexive_pred2(_First,_Last,_Pred) 2025-05-07T19:48:32.9989620Z #define __glibcxx_requires_non_empty_range(_First,_Last) 2025-05-07T19:48:32.9989716Z #define __glibcxx_requires_nonempty() 2025-05-07T19:48:32.9989892Z #define __glibcxx_requires_partitioned_lower(_First,_Last,_Value) 2025-05-07T19:48:32.9990117Z #define __glibcxx_requires_partitioned_lower_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:32.9990291Z #define __glibcxx_requires_partitioned_upper(_First,_Last,_Value) 2025-05-07T19:48:32.9990514Z #define __glibcxx_requires_partitioned_upper_pred(_First,_Last,_Value,_Pred) 2025-05-07T19:48:32.9990648Z #define __glibcxx_requires_sorted(_First,_Last) 2025-05-07T19:48:32.9990800Z #define __glibcxx_requires_sorted_pred(_First,_Last,_Pred) 2025-05-07T19:48:32.9990958Z #define __glibcxx_requires_sorted_set(_First1,_Last1,_First2) 2025-05-07T19:48:32.9991167Z #define __glibcxx_requires_sorted_set_pred(_First1,_Last1,_First2,_Pred) 2025-05-07T19:48:32.9991276Z #define __glibcxx_requires_string(_String) 2025-05-07T19:48:32.9991403Z #define __glibcxx_requires_string_len(_String,_Len) 2025-05-07T19:48:32.9991506Z #define __glibcxx_requires_subscript(_N) 2025-05-07T19:48:32.9991650Z #define __glibcxx_requires_valid_range(_First,_Last) 2025-05-07T19:48:32.9991756Z #define __glibcxx_signed_b(T,B) ((T)(-1) < 0) 2025-05-07T19:48:32.9991851Z #define __global__ __location__(global) 2025-05-07T19:48:32.9991941Z #define __gnu_linux__ 1 2025-05-07T19:48:32.9992069Z #define __grid_constant__ __location__(grid_constant) 2025-05-07T19:48:32.9992470Z #define __have_pthread_attr_t 1 2025-05-07T19:48:32.9992657Z #define __host__ __location__(host) 2025-05-07T19:48:32.9992913Z #define __id_t_defined 2025-05-07T19:48:32.9992994Z #define __import__ 2025-05-07T19:48:32.9993080Z #define __ino64_t_defined 2025-05-07T19:48:32.9993175Z #define __ino_t_defined 2025-05-07T19:48:32.9993258Z #define __int8_t_defined 2025-05-07T19:48:32.9993496Z #define __intN_t(N,MODE) typedef int int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:32.9993706Z #define __isleap(year) ((year) % 4 == 0 && ((year) % 100 != 0 || (year) % 400 == 0)) 2025-05-07T19:48:32.9993791Z #define __k8 1 2025-05-07T19:48:32.9994060Z #define __k8__ 1 2025-05-07T19:48:32.9994142Z #define __key_t_defined 2025-05-07T19:48:32.9994358Z #define __launch_bounds__(...) __annotate__(launch_bounds(__VA_ARGS__)) 2025-05-07T19:48:32.9994447Z #define __ldiv_t_defined 1 2025-05-07T19:48:32.9994523Z #define __linux 1 2025-05-07T19:48:32.9994609Z #define __linux__ 1 2025-05-07T19:48:32.9994698Z #define __lldiv_t_defined 1 2025-05-07T19:48:32.9994782Z #define __llvm__ 1 2025-05-07T19:48:32.9994885Z #define __location__(a) __annotate__(a) 2025-05-07T19:48:32.9994994Z #define __long_double_t long double 2025-05-07T19:48:32.9995087Z #define __malloc_and_calloc_defined 2025-05-07T19:48:32.9995194Z #define __managed__ __location__(managed) 2025-05-07T19:48:32.9995292Z #define __mode_t_defined 2025-05-07T19:48:32.9995371Z #define __need_IOV_MAX 2025-05-07T19:48:32.9995454Z #define __need_clock_t 2025-05-07T19:48:32.9995546Z #define __need_clockid_t 2025-05-07T19:48:32.9995644Z #define __need_time_t 2025-05-07T19:48:32.9995725Z #define __need_timer_t 2025-05-07T19:48:32.9995811Z #define __need_timespec 2025-05-07T19:48:32.9995903Z #define __nlink_t_defined 2025-05-07T19:48:32.9996027Z #define __no_return__ __attribute__((noreturn)) 2025-05-07T19:48:32.9996148Z #define __noinline__ __attribute__((noinline)) 2025-05-07T19:48:32.9996320Z #define __nonnull(params) __attribute__ ((__nonnull__ params)) 2025-05-07T19:48:32.9996415Z #define __off64_t_defined 2025-05-07T19:48:32.9996499Z #define __off_t_defined 2025-05-07T19:48:32.9996582Z #define __pic__ 2 2025-05-07T19:48:32.9996672Z #define __pid_t_defined 2025-05-07T19:48:32.9996751Z #define __pie__ 2 2025-05-07T19:48:32.9996850Z #define __private_extern__ extern 2025-05-07T19:48:32.9996933Z #define __ptr_t void * 2025-05-07T19:48:32.9997020Z #define __ptrvalue 2025-05-07T19:48:32.9997100Z #define __restrict_arr 2025-05-07T19:48:32.9997236Z #define __seg_fs __attribute__((address_space(257))) 2025-05-07T19:48:32.9997377Z #define __seg_gs __attribute__((address_space(256))) 2025-05-07T19:48:32.9997479Z #define __shared__ __location__(shared) 2025-05-07T19:48:32.9997568Z #define __sigset_t_defined 2025-05-07T19:48:32.9997670Z #define __specialization_static 2025-05-07T19:48:32.9997764Z #define __ssize_t_defined 2025-05-07T19:48:32.9997848Z #define __stub_bdflush 2025-05-07T19:48:32.9997930Z #define __stub_chflags 2025-05-07T19:48:32.9998026Z #define __stub_fattach 2025-05-07T19:48:32.9998117Z #define __stub_fchflags 2025-05-07T19:48:32.9998202Z #define __stub_fdetach 2025-05-07T19:48:32.9998283Z #define __stub_getmsg 2025-05-07T19:48:32.9998371Z #define __stub_gtty 2025-05-07T19:48:32.9998453Z #define __stub_lchmod 2025-05-07T19:48:32.9998533Z #define __stub_putmsg 2025-05-07T19:48:32.9998633Z #define __stub_revoke 2025-05-07T19:48:32.9998721Z #define __stub_setlogin 2025-05-07T19:48:32.9998806Z #define __stub_sigreturn 2025-05-07T19:48:32.9998886Z #define __stub_sstk 2025-05-07T19:48:32.9998974Z #define __stub_stty 2025-05-07T19:48:32.9999065Z #define __suseconds_t_defined 2025-05-07T19:48:32.9999150Z #define __thread__ __thread 2025-05-07T19:48:32.9999259Z #define __throw_exception_again throw 2025-05-07T19:48:32.9999346Z #define __time_t_defined 1 2025-05-07T19:48:32.9999441Z #define __timer_t_defined 1 2025-05-07T19:48:32.9999532Z #define __timespec_defined 1 2025-05-07T19:48:32.9999624Z #define __try try 2025-05-07T19:48:32.9999763Z #define __tune_k8__ 1 2025-05-07T19:48:32.9999901Z #define __u_char_defined 2025-05-07T19:48:33.0000190Z #define __u_intN_t(N,MODE) typedef unsigned int u_int##N##_t __attribute__ ((__mode__ (MODE))) 2025-05-07T19:48:33.0000280Z #define __uid_t_defined 2025-05-07T19:48:33.0000359Z #define __unbounded 2025-05-07T19:48:33.0000434Z #define __unix 1 2025-05-07T19:48:33.0000514Z #define __unix__ 1 2025-05-07T19:48:33.0000611Z #define __useconds_t_defined 2025-05-07T19:48:33.0000695Z #define __warnattr(msg) 2025-05-07T19:48:33.0000835Z #define __warndecl(name,msg) extern void name (void) 2025-05-07T19:48:33.0000913Z #define __wur 2025-05-07T19:48:33.0000996Z #define __x86_64 1 2025-05-07T19:48:33.0001080Z #define __x86_64__ 1 2025-05-07T19:48:33.0001205Z #define alloca(size) __builtin_alloca (size) 2025-05-07T19:48:33.0001565Z #define assert(expr) ((expr) ? __ASSERT_VOID_CAST (0) : __assert_fail (__STRING(expr), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:33.0001985Z #define assert_perror(errnum) (!(errnum) ? __ASSERT_VOID_CAST (0) : __assert_perror_fail ((errnum), __FILE__, __LINE__, __ASSERT_FUNCTION)) 2025-05-07T19:48:33.0002099Z #define be16toh(x) __bswap_16 (x) 2025-05-07T19:48:33.0002192Z #define be32toh(x) __bswap_32 (x) 2025-05-07T19:48:33.0002287Z #define be64toh(x) __bswap_64 (x) 2025-05-07T19:48:33.0002408Z #define cudaArrayColorAttachment 0x20 2025-05-07T19:48:33.0002507Z #define cudaArrayCubemap 0x04 2025-05-07T19:48:33.0002600Z #define cudaArrayDefault 0x00 2025-05-07T19:48:33.0002709Z #define cudaArrayDeferredMapping 0x80 2025-05-07T19:48:33.0002816Z #define cudaArrayLayered 0x01 2025-05-07T19:48:33.0002913Z #define cudaArraySparse 0x40 2025-05-07T19:48:33.0003072Z #define cudaArraySparsePropertiesSingleMipTail 0x1 2025-05-07T19:48:33.0003195Z #define cudaArraySurfaceLoadStore 0x02 2025-05-07T19:48:33.0003299Z #define cudaArrayTextureGather 0x08 2025-05-07T19:48:33.0003488Z #define cudaCooperativeLaunchMultiDeviceNoPostSync 0x02 2025-05-07T19:48:33.0003664Z #define cudaCooperativeLaunchMultiDeviceNoPreSync 0x01 2025-05-07T19:48:33.0003787Z #define cudaCpuDeviceId ((int)-1) 2025-05-07T19:48:33.0003892Z #define cudaDeviceBlockingSync 0x04 2025-05-07T19:48:33.0004000Z #define cudaDeviceLmemResizeToMax 0x10 2025-05-07T19:48:33.0004107Z #define cudaDeviceMapHost 0x08 2025-05-07T19:48:33.0004196Z #define cudaDeviceMask 0x1f 2025-05-07T19:48:33.0004723Z #define cudaDevicePropDontCare { {'\0'}, {{0}}, {'\0'}, 0, 0, 0, 0, 0, 0, 0, {0, 0, 0}, {0, 0, 0}, 0, 0, -1, -1, 0, 0, -1, 0, 0, 0, 0, 0, 0, 0, 0, {0, 0}, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, {0, 0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, 0, {0, 0}, {0, 0, 0}, {0, 0}, {0, 0, 0}, 0, {0, 0}, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, } 2025-05-07T19:48:33.0004837Z #define cudaDeviceScheduleAuto 0x00 2025-05-07T19:48:33.0005077Z #define cudaDeviceScheduleBlockingSync 0x04 2025-05-07T19:48:33.0005171Z #define cudaDeviceScheduleMask 0x07 2025-05-07T19:48:33.0005282Z #define cudaDeviceScheduleSpin 0x01 2025-05-07T19:48:33.0005382Z #define cudaDeviceScheduleYield 0x02 2025-05-07T19:48:33.0005478Z #define cudaEventBlockingSync 0x01 2025-05-07T19:48:33.0005564Z #define cudaEventDefault 0x00 2025-05-07T19:48:33.0005670Z #define cudaEventDisableTiming 0x02 2025-05-07T19:48:33.0005763Z #define cudaEventInterprocess 0x04 2025-05-07T19:48:33.0005855Z #define cudaEventRecordDefault 0x00 2025-05-07T19:48:33.0005958Z #define cudaEventRecordExternal 0x01 2025-05-07T19:48:33.0006049Z #define cudaEventWaitDefault 0x00 2025-05-07T19:48:33.0006144Z #define cudaEventWaitExternal 0x01 2025-05-07T19:48:33.0006250Z #define cudaExternalMemoryDedicated 0x1 2025-05-07T19:48:33.0006452Z #define cudaExternalSemaphoreSignalSkipNvSciBufMemSync 0x01 2025-05-07T19:48:33.0006624Z #define cudaExternalSemaphoreWaitSkipNvSciBufMemSync 0x02 2025-05-07T19:48:33.0006722Z #define cudaHostAllocDefault 0x00 2025-05-07T19:48:33.0006829Z #define cudaHostAllocMapped 0x02 2025-05-07T19:48:33.0020251Z #define cudaHostAllocPortable 0x01 2025-05-07T19:48:33.0020583Z #define cudaHostAllocWriteCombined 0x04 2025-05-07T19:48:33.0020758Z #define cudaHostRegisterDefault 0x00 2025-05-07T19:48:33.0020875Z #define cudaHostRegisterIoMemory 0x04 2025-05-07T19:48:33.0020987Z #define cudaHostRegisterMapped 0x02 2025-05-07T19:48:33.0021092Z #define cudaHostRegisterPortable 0x01 2025-05-07T19:48:33.0021205Z #define cudaHostRegisterReadOnly 0x08 2025-05-07T19:48:33.0021306Z #define cudaInvalidDeviceId ((int)-2) 2025-05-07T19:48:33.0021430Z #define cudaIpcMemLazyEnablePeerAccess 0x01 2025-05-07T19:48:33.0021571Z #define cudaKernelNodeAttrID cudaLaunchAttributeID 2025-05-07T19:48:33.0021739Z #define cudaKernelNodeAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:33.0022057Z #define cudaKernelNodeAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:33.0022347Z #define cudaKernelNodeAttributeClusterDimension cudaLaunchAttributeClusterDimension 2025-05-07T19:48:33.0022840Z #define cudaKernelNodeAttributeClusterSchedulingPolicyPreference cudaLaunchAttributeClusterSchedulingPolicyPreference 2025-05-07T19:48:33.0023089Z #define cudaKernelNodeAttributeCooperative cudaLaunchAttributeCooperative 2025-05-07T19:48:33.0023303Z #define cudaKernelNodeAttributePriority cudaLaunchAttributePriority 2025-05-07T19:48:33.0023418Z #define cudaMemAttachGlobal 0x01 2025-05-07T19:48:33.0023515Z #define cudaMemAttachHost 0x02 2025-05-07T19:48:33.0023610Z #define cudaMemAttachSingle 0x04 2025-05-07T19:48:33.0023707Z #define cudaNvSciSyncAttrSignal 0x1 2025-05-07T19:48:33.0023827Z #define cudaNvSciSyncAttrWait 0x2 2025-05-07T19:48:33.0023922Z #define cudaOccupancyDefault 0x00 2025-05-07T19:48:33.0024061Z #define cudaOccupancyDisableCachingOverride 0x01 2025-05-07T19:48:33.0024169Z #define cudaPeerAccessDefault 0x00 2025-05-07T19:48:33.0024508Z #define cudaSignalExternalSemaphoresAsync __CUDART_API_PTSZ(cudaSignalExternalSemaphoresAsync_v2) 2025-05-07T19:48:33.0024629Z #define cudaStreamAttrID cudaLaunchAttributeID 2025-05-07T19:48:33.0024783Z #define cudaStreamAttrValue cudaLaunchAttributeValue 2025-05-07T19:48:33.0025081Z #define cudaStreamAttributeAccessPolicyWindow cudaLaunchAttributeAccessPolicyWindow 2025-05-07T19:48:33.0025403Z #define cudaStreamAttributeSynchronizationPolicy cudaLaunchAttributeSynchronizationPolicy 2025-05-07T19:48:33.0025497Z #define cudaStreamDefault 0x00 2025-05-07T19:48:33.0025619Z #define cudaStreamLegacy ((cudaStream_t)0x1) 2025-05-07T19:48:33.0025715Z #define cudaStreamNonBlocking 0x01 2025-05-07T19:48:33.0025836Z #define cudaStreamPerThread ((cudaStream_t)0x2) 2025-05-07T19:48:33.0025937Z #define cudaSurfaceType1D 0x01 2025-05-07T19:48:33.0026044Z #define cudaSurfaceType1DLayered 0xF1 2025-05-07T19:48:33.0026137Z #define cudaSurfaceType2D 0x02 2025-05-07T19:48:33.0026239Z #define cudaSurfaceType2DLayered 0xF2 2025-05-07T19:48:33.0026338Z #define cudaSurfaceType3D 0x03 2025-05-07T19:48:33.0026438Z #define cudaSurfaceTypeCubemap 0x0C 2025-05-07T19:48:33.0026554Z #define cudaSurfaceTypeCubemapLayered 0xFC 2025-05-07T19:48:33.0026656Z #define cudaTextureType1D 0x01 2025-05-07T19:48:33.0026758Z #define cudaTextureType1DLayered 0xF1 2025-05-07T19:48:33.0026847Z #define cudaTextureType2D 0x02 2025-05-07T19:48:33.0026944Z #define cudaTextureType2DLayered 0xF2 2025-05-07T19:48:33.0027040Z #define cudaTextureType3D 0x03 2025-05-07T19:48:33.0027139Z #define cudaTextureTypeCubemap 0x0C 2025-05-07T19:48:33.0027253Z #define cudaTextureTypeCubemapLayered 0xFC 2025-05-07T19:48:33.0027578Z #define cudaWaitExternalSemaphoresAsync __CUDART_API_PTSZ(cudaWaitExternalSemaphoresAsync_v2) 2025-05-07T19:48:33.0027669Z #define getc(_fp) _IO_getc (_fp) 2025-05-07T19:48:33.0027755Z #define htobe16(x) __bswap_16 (x) 2025-05-07T19:48:33.0027849Z #define htobe32(x) __bswap_32 (x) 2025-05-07T19:48:33.0027932Z #define htobe64(x) __bswap_64 (x) 2025-05-07T19:48:33.0028009Z #define htole16(x) (x) 2025-05-07T19:48:33.0028084Z #define htole32(x) (x) 2025-05-07T19:48:33.0028169Z #define htole64(x) (x) 2025-05-07T19:48:33.0028245Z #define le16toh(x) (x) 2025-05-07T19:48:33.0028371Z #define le32toh(x) (x) 2025-05-07T19:48:33.0028518Z #define le64toh(x) (x) 2025-05-07T19:48:33.0028592Z #define linux 1 2025-05-07T19:48:33.0028688Z #define major(dev) gnu_dev_major (dev) 2025-05-07T19:48:33.0028808Z #define makedev(maj,min) gnu_dev_makedev (maj, min) 2025-05-07T19:48:33.0028952Z #define math_errhandling (MATH_ERRNO | MATH_ERREXCEPT) 2025-05-07T19:48:33.0029046Z #define minor(dev) gnu_dev_minor (dev) 2025-05-07T19:48:33.0029164Z #define offsetof(t,d) __builtin_offsetof(t, d) 2025-05-07T19:48:33.0029277Z #define putc(_ch,_fp) _IO_putc (_ch, _fp) 2025-05-07T19:48:33.0029356Z #define stderr stderr 2025-05-07T19:48:33.0029438Z #define stdin stdin 2025-05-07T19:48:33.0029524Z #define stdout stdout 2025-05-07T19:48:33.0030021Z #define strdupa(s) (__extension__ ({ const char *__old = (s); size_t __len = strlen (__old) + 1; char *__new = (char *) __builtin_alloca (__len); (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:33.0030555Z #define strndupa(s,n) (__extension__ ({ const char *__old = (s); size_t __len = strnlen (__old, (n)); char *__new = (char *) __builtin_alloca (__len + 1); __new[__len] = '\0'; (char *) memcpy (__new, __old, __len); })) 2025-05-07T19:48:33.0030636Z #define unix 1 2025-05-07T19:48:33.0030769Z #define w_coredump __wait_terminated.__w_coredump 2025-05-07T19:48:33.0030885Z #define w_retcode __wait_terminated.__w_retcode 2025-05-07T19:48:33.0030997Z #define w_stopsig __wait_stopped.__w_stopsig 2025-05-07T19:48:33.0031110Z #define w_stopval __wait_stopped.__w_stopval 2025-05-07T19:48:33.0031413Z #define w_termsig __wait_terminated.__w_termsig 2025-05-07T19:48:33.0031421Z 2025-05-07T19:48:33.0146554Z 2025-05-07T19:48:33.0147738Z + conda run -n build_binary nvcc --version 2025-05-07T19:48:33.0147760Z 2025-05-07T19:48:34.8188739Z nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:48:34.8189762Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:48:34.8190686Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:48:34.8191676Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:48:34.8192892Z Build cuda_11.8.r11.8/compiler.31833905_0 2025-05-07T19:48:34.8193532Z 2025-05-07T19:48:34.8759999Z 2025-05-07T19:48:34.8767653Z [CHECK] nvidia-smi not found 2025-05-07T19:48:34.8769772Z which: no nvidia-smi in (CONDA=/github/home/miniconda:/github/home/miniconda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin) 2025-05-07T19:48:34.8770922Z [INSTALL] Successfully installed CUDA 11.8.0 2025-05-07T19:48:34.8864405Z ##[group]Run . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:34.8865032Z . $PRELUDE; install_pytorch_pip $BUILD_ENV nightly cuda/11.8.0 2025-05-07T19:48:34.8865660Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:48:34.8866009Z env: 2025-05-07T19:48:34.8866254Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:48:34.8866570Z BUILD_ENV: build_binary 2025-05-07T19:48:34.8866848Z BUILD_TARGET: default 2025-05-07T19:48:34.8867087Z BUILD_VARIANT: cuda 2025-05-07T19:48:34.8867347Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:48:34.8867624Z ##[endgroup] 2025-05-07T19:48:35.3403594Z ################################################################################ 2025-05-07T19:48:35.3404020Z # Install PyTorch (PIP) 2025-05-07T19:48:35.3404267Z # 2025-05-07T19:48:35.3419023Z # [2025-05-07T19:48:35.341Z] + install_pytorch_pip build_binary nightly cuda/11.8.0 2025-05-07T19:48:35.3419602Z ################################################################################ 2025-05-07T19:48:35.3419881Z 2025-05-07T19:48:35.3442157Z [EXEC] [ATTEMPT 0/3] + conda install -n build_binary -c conda-forge --override-channels -y numpy 2025-05-07T19:48:36.2154419Z Channels: 2025-05-07T19:48:36.2155132Z - conda-forge 2025-05-07T19:48:36.2155774Z Platform: linux-64 2025-05-07T19:48:45.8063108Z Collecting package metadata (repodata.json): - \ | / - \ | / - \ | / - \ | / done 2025-05-07T19:48:47.3420589Z Solving environment: \ | / done 2025-05-07T19:48:47.5190361Z 2025-05-07T19:48:47.5192034Z ## Package Plan ## 2025-05-07T19:48:47.5192624Z 2025-05-07T19:48:47.5193029Z environment location: /github/home/miniconda/envs/build_binary 2025-05-07T19:48:47.5193386Z 2025-05-07T19:48:47.5193502Z added / updated specs: 2025-05-07T19:48:47.5193801Z - numpy 2025-05-07T19:48:47.5193936Z 2025-05-07T19:48:47.5193940Z 2025-05-07T19:48:47.5194074Z The following packages will be downloaded: 2025-05-07T19:48:47.5194378Z 2025-05-07T19:48:47.5194508Z package | build 2025-05-07T19:48:47.5194867Z ---------------------------|----------------- 2025-05-07T19:48:47.5195319Z libblas-3.9.0 |31_h59b9bed_openblas 16 KB conda-forge 2025-05-07T19:48:47.5195852Z libcblas-3.9.0 |31_he106b2a_openblas 16 KB conda-forge 2025-05-07T19:48:47.5196368Z liblapack-3.9.0 |31_h7ac8fdf_openblas 16 KB conda-forge 2025-05-07T19:48:47.5196973Z numpy-2.2.5 | py310hefbff90_0 7.6 MB conda-forge 2025-05-07T19:48:47.5197387Z ------------------------------------------------------------ 2025-05-07T19:48:47.5197770Z Total: 7.6 MB 2025-05-07T19:48:47.5197988Z 2025-05-07T19:48:47.5198122Z The following NEW packages will be INSTALLED: 2025-05-07T19:48:47.5198378Z 2025-05-07T19:48:47.5198632Z libblas conda-forge/linux-64::libblas-3.9.0-31_h59b9bed_openblas 2025-05-07T19:48:47.5199185Z libcblas conda-forge/linux-64::libcblas-3.9.0-31_he106b2a_openblas 2025-05-07T19:48:47.5199723Z liblapack conda-forge/linux-64::liblapack-3.9.0-31_h7ac8fdf_openblas 2025-05-07T19:48:47.5200272Z numpy conda-forge/linux-64::numpy-2.2.5-py310hefbff90_0 2025-05-07T19:48:47.5200551Z 2025-05-07T19:48:47.5200555Z 2025-05-07T19:48:47.5200559Z 2025-05-07T19:48:47.5200730Z Downloading and Extracting Packages: ...working... 2025-05-07T19:48:47.5215808Z numpy-2.2.5 | 7.6 MB | | 0% 2025-05-07T19:48:47.5216689Z 2025-05-07T19:48:47.5217818Z libblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:47.5218458Z 2025-05-07T19:48:47.5218469Z 2025-05-07T19:48:47.5222759Z libcblas-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:47.5223034Z 2025-05-07T19:48:47.5223046Z 2025-05-07T19:48:47.5223050Z 2025-05-07T19:48:47.5811796Z liblapack-3.9.0 | 16 KB | | 0%  2025-05-07T19:48:47.5812288Z 2025-05-07T19:48:47.5812292Z 2025-05-07T19:48:47.5913411Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:47.5913734Z 2025-05-07T19:48:47.5913738Z 2025-05-07T19:48:47.5958732Z libcblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:47.5959582Z 2025-05-07T19:48:47.5959608Z 2025-05-07T19:48:47.5959619Z 2025-05-07T19:48:47.6040642Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:47.6040968Z 2025-05-07T19:48:47.6040973Z 2025-05-07T19:48:47.6040977Z 2025-05-07T19:48:47.6194376Z liblapack-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:47.6417811Z numpy-2.2.5 | 7.6 MB | #####1 | 51% 2025-05-07T19:48:47.6418600Z 2025-05-07T19:48:47.6419477Z libblas-3.9.0 | 16 KB | #########7 | 97%  2025-05-07T19:48:47.6420220Z 2025-05-07T19:48:47.6500091Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:47.6500953Z 2025-05-07T19:48:47.6779705Z libblas-3.9.0 | 16 KB | ########## | 100%  2025-05-07T19:48:48.0162506Z numpy-2.2.5 | 7.6 MB | ########## | 100% 2025-05-07T19:48:48.0163903Z numpy-2.2.5 | 7.6 MB | ########## | 100% 2025-05-07T19:48:48.0164920Z 2025-05-07T19:48:48.0165580Z 2025-05-07T19:48:48.0166229Z  2025-05-07T19:48:48.0166866Z 2025-05-07T19:48:48.0166878Z 2025-05-07T19:48:48.0167406Z  2025-05-07T19:48:48.0168513Z 2025-05-07T19:48:48.0168524Z 2025-05-07T19:48:48.0168536Z 2025-05-07T19:48:48.0169090Z  done 2025-05-07T19:48:48.1176526Z Preparing transaction: \ done 2025-05-07T19:48:48.2182480Z Verifying transaction: / done 2025-05-07T19:48:48.3191373Z Executing transaction: \ done 2025-05-07T19:48:48.4199154Z ################################################################################ 2025-05-07T19:48:48.4200186Z # Install Package From PyTorch PIP: torch 2025-05-07T19:48:48.4200536Z # 2025-05-07T19:48:48.4220715Z # [2025-05-07T19:48:48.421Z] + install_from_pytorch_pip build_binary torch nightly cuda/11.8.0 2025-05-07T19:48:48.4221279Z ################################################################################ 2025-05-07T19:48:48.4221537Z 2025-05-07T19:48:48.4246347Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:48:48.5115140Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:48:48.5116308Z ################################################################################ 2025-05-07T19:48:48.5117342Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:48:48.5118162Z # 2025-05-07T19:48:48.5132112Z # [2025-05-07T19:48:48.512Z] + __prepare_pip_arguments torch nightly cuda/11.8.0 2025-05-07T19:48:48.5132645Z ################################################################################ 2025-05-07T19:48:48.5132921Z 2025-05-07T19:48:48.5158588Z [INSTALL] Extracted package (channel, version): (nightly, LATEST) 2025-05-07T19:48:48.5186803Z [INSTALL] Extracted package variant: cu118 2025-05-07T19:48:48.5197281Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:48:48.5197889Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:48:48.5205096Z [INSTALL] Extracted the full PIP package: --pre torch 2025-05-07T19:48:48.5216846Z [INSTALL] Attempting to install [torch, LATEST] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/cu118/ ... 2025-05-07T19:48:48.5238812Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre torch --index-url https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:50:08.6290833Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:50:08.6292498Z 2025-05-07T19:50:08.6292752Z Looking in indexes: https://download.pytorch.org/whl/nightly/cu118/ 2025-05-07T19:50:08.6293178Z Collecting torch 2025-05-07T19:50:08.6293903Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp310-cp310-manylinux_2_28_x86_64.whl.metadata (29 kB) 2025-05-07T19:50:08.6294683Z Collecting filelock (from torch) 2025-05-07T19:50:08.6295267Z Downloading https://download.pytorch.org/whl/nightly/filelock-3.16.1-py3-none-any.whl (16 kB) 2025-05-07T19:50:08.6296282Z Requirement already satisfied: typing-extensions>=4.10.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from torch) (4.13.2) 2025-05-07T19:50:08.6297078Z Collecting sympy>=1.13.3 (from torch) 2025-05-07T19:50:08.6297635Z Downloading https://download.pytorch.org/whl/nightly/sympy-1.13.3-py3-none-any.whl (6.2 MB) 2025-05-07T19:50:08.6298599Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 6.2/6.2 MB 224.7 MB/s eta 0:00:00 2025-05-07T19:50:08.6298993Z Collecting networkx (from torch) 2025-05-07T19:50:08.6299526Z Downloading https://download.pytorch.org/whl/nightly/networkx-3.4.2-py3-none-any.whl (1.7 MB) 2025-05-07T19:50:08.6300244Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 1.7/1.7 MB 156.7 MB/s eta 0:00:00 2025-05-07T19:50:08.6301011Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from torch) (3.1.6) 2025-05-07T19:50:08.6301913Z Collecting fsspec (from torch) 2025-05-07T19:50:08.6302457Z Downloading https://download.pytorch.org/whl/nightly/fsspec-2024.10.0-py3-none-any.whl (179 kB) 2025-05-07T19:50:08.6303121Z Collecting nvidia-cuda-nvrtc-cu11==11.8.89 (from torch) 2025-05-07T19:50:08.6304000Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_nvrtc_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (23.2 MB) 2025-05-07T19:50:08.6304858Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 23.2/23.2 MB 219.3 MB/s eta 0:00:00 2025-05-07T19:50:08.6305288Z Collecting nvidia-cuda-runtime-cu11==11.8.89 (from torch) 2025-05-07T19:50:08.6306065Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_runtime_cu11-11.8.89-py3-none-manylinux1_x86_64.whl (875 kB) 2025-05-07T19:50:08.6306984Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 875.6/875.6 kB 101.7 MB/s eta 0:00:00 2025-05-07T19:50:08.6307404Z Collecting nvidia-cuda-cupti-cu11==11.8.87 (from torch) 2025-05-07T19:50:08.6308138Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cuda_cupti_cu11-11.8.87-py3-none-manylinux1_x86_64.whl (13.1 MB) 2025-05-07T19:50:08.6308921Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 13.1/13.1 MB 190.5 MB/s eta 0:00:00 2025-05-07T19:50:08.6309320Z Collecting nvidia-cudnn-cu11==9.1.0.70 (from torch) 2025-05-07T19:50:08.6310024Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cudnn_cu11-9.1.0.70-py3-none-manylinux2014_x86_64.whl (663.9 MB) 2025-05-07T19:50:08.6310829Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 663.9/663.9 MB 50.3 MB/s eta 0:00:00 2025-05-07T19:50:08.6311240Z Collecting nvidia-cublas-cu11==11.11.3.6 (from torch) 2025-05-07T19:50:08.6311938Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cublas_cu11-11.11.3.6-py3-none-manylinux1_x86_64.whl (417.9 MB) 2025-05-07T19:50:08.6313037Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 417.9/417.9 MB 80.2 MB/s eta 0:00:00 2025-05-07T19:50:08.6313457Z Collecting nvidia-cufft-cu11==10.9.0.58 (from torch) 2025-05-07T19:50:08.6314304Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cufft_cu11-10.9.0.58-py3-none-manylinux1_x86_64.whl (168.4 MB) 2025-05-07T19:50:08.6315157Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 168.4/168.4 MB 160.2 MB/s eta 0:00:00 2025-05-07T19:50:08.6315576Z Collecting nvidia-curand-cu11==10.3.0.86 (from torch) 2025-05-07T19:50:08.6316474Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_curand_cu11-10.3.0.86-py3-none-manylinux1_x86_64.whl (58.1 MB) 2025-05-07T19:50:08.6317310Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 58.1/58.1 MB 164.6 MB/s eta 0:00:00 2025-05-07T19:50:08.6317761Z Collecting nvidia-cusolver-cu11==11.4.1.48 (from torch) 2025-05-07T19:50:08.6318528Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusolver_cu11-11.4.1.48-py3-none-manylinux1_x86_64.whl (128.2 MB) 2025-05-07T19:50:08.6319497Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 128.2/128.2 MB 138.6 MB/s eta 0:00:00 2025-05-07T19:50:08.6319914Z Collecting nvidia-cusparse-cu11==11.7.5.86 (from torch) 2025-05-07T19:50:08.6320627Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_cusparse_cu11-11.7.5.86-py3-none-manylinux1_x86_64.whl (204.1 MB) 2025-05-07T19:50:08.6321429Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 204.1/204.1 MB 156.2 MB/s eta 0:00:00 2025-05-07T19:50:08.6321793Z Collecting nvidia-nccl-cu11==2.21.5 (from torch) 2025-05-07T19:50:08.6322480Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nccl_cu11-2.21.5-py3-none-manylinux2014_x86_64.whl (147.8 MB) 2025-05-07T19:50:08.6323259Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 147.8/147.8 MB 190.2 MB/s eta 0:00:00 2025-05-07T19:50:08.6323627Z Collecting nvidia-nvtx-cu11==11.8.86 (from torch) 2025-05-07T19:50:08.6324292Z Downloading https://download.pytorch.org/whl/nightly/cu118/nvidia_nvtx_cu11-11.8.86-py3-none-manylinux1_x86_64.whl (99 kB) 2025-05-07T19:50:08.6324973Z Collecting pytorch-triton==3.3.0+git96316ce5 (from torch) 2025-05-07T19:50:08.6325836Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.6 kB) 2025-05-07T19:50:08.6328541Z Requirement already satisfied: setuptools>=40.8.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from pytorch-triton==3.3.0+git96316ce5->torch) (78.1.1) 2025-05-07T19:50:08.6329426Z Collecting mpmath<1.4,>=1.1.0 (from sympy>=1.13.3->torch) 2025-05-07T19:50:08.6329997Z Downloading https://download.pytorch.org/whl/nightly/mpmath-1.3.0-py3-none-any.whl (536 kB) 2025-05-07T19:50:08.6330654Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 536.2/536.2 kB 45.4 MB/s eta 0:00:00 2025-05-07T19:50:08.6332011Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from jinja2->torch) (3.0.2) 2025-05-07T19:50:08.6333228Z Downloading https://download.pytorch.org/whl/nightly/cu118/torch-2.8.0.dev20250507%2Bcu118-cp310-cp310-manylinux_2_28_x86_64.whl (916.4 MB) 2025-05-07T19:50:08.6334139Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 916.4/916.4 MB 29.8 MB/s eta 0:00:00 2025-05-07T19:50:08.6335027Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.3.0%2Bgit96316ce5-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (153.4 MB) 2025-05-07T19:50:08.6336143Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 153.4/153.4 MB 68.5 MB/s eta 0:00:00 2025-05-07T19:50:08.6337745Z Installing collected packages: mpmath, sympy, pytorch-triton, nvidia-nvtx-cu11, nvidia-nccl-cu11, nvidia-cusparse-cu11, nvidia-curand-cu11, nvidia-cufft-cu11, nvidia-cuda-runtime-cu11, nvidia-cuda-nvrtc-cu11, nvidia-cuda-cupti-cu11, nvidia-cublas-cu11, networkx, fsspec, filelock, nvidia-cusolver-cu11, nvidia-cudnn-cu11, torch 2025-05-07T19:50:08.6339211Z 2025-05-07T19:50:08.6340859Z Successfully installed filelock-3.16.1 fsspec-2024.10.0 mpmath-1.3.0 networkx-3.4.2 nvidia-cublas-cu11-11.11.3.6 nvidia-cuda-cupti-cu11-11.8.87 nvidia-cuda-nvrtc-cu11-11.8.89 nvidia-cuda-runtime-cu11-11.8.89 nvidia-cudnn-cu11-9.1.0.70 nvidia-cufft-cu11-10.9.0.58 nvidia-curand-cu11-10.3.0.86 nvidia-cusolver-cu11-11.4.1.48 nvidia-cusparse-cu11-11.7.5.86 nvidia-nccl-cu11-2.21.5 nvidia-nvtx-cu11-11.8.86 pytorch-triton-3.3.0+git96316ce5 sympy-1.13.3 torch-2.8.0.dev20250507+cu118 2025-05-07T19:50:08.6342582Z 2025-05-07T19:50:10.8510377Z torch 2.8.0.dev20250507+cu118 2025-05-07T19:50:10.8511758Z [CHECK] The installed package [torch, nightly/LATEST] is the correct variant (cu118) 2025-05-07T19:50:14.1613643Z [CHECK] Python (sub-)package 'torch.distributed' found ... 2025-05-07T19:50:17.5122477Z [CHECK] NOTE: The installed version is: 2.8.0.dev20250507+cu118 2025-05-07T19:50:17.5122984Z [CHECK] NOTE: Checking _GLIBCXX_USE_CXX11_ABI ... 2025-05-07T19:50:20.7811952Z True 2025-05-07T19:50:20.7812228Z True 2025-05-07T19:50:20.7812338Z 2025-05-07T19:50:20.8386286Z [INSTALL] Successfully installed PyTorch through PyTorch PIP 2025-05-07T19:50:20.8456559Z ##[group]Run if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:50:20.8457395Z if . $PRELUDE && which conda; then collect_pytorch_env_info $BUILD_ENV; fi 2025-05-07T19:50:20.8458118Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:20.8458458Z env: 2025-05-07T19:50:20.8458680Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:20.8459006Z BUILD_ENV: build_binary 2025-05-07T19:50:20.8459270Z BUILD_TARGET: default 2025-05-07T19:50:20.8459501Z BUILD_VARIANT: cuda 2025-05-07T19:50:20.8459752Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:20.8459998Z ##[endgroup] 2025-05-07T19:50:21.2821649Z /github/home/miniconda/bin/conda 2025-05-07T19:50:21.2823437Z ################################################################################ 2025-05-07T19:50:21.2823912Z # Collect PyTorch Environment Information (for Reporting Issues) 2025-05-07T19:50:21.2824327Z # 2025-05-07T19:50:21.2844683Z # [2025-05-07T19:50:21.283Z] + collect_pytorch_env_info build_binary 2025-05-07T19:50:21.2845183Z ################################################################################ 2025-05-07T19:50:21.2845441Z 2025-05-07T19:50:21.2864776Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:21.3763781Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:21.3771637Z [INFO] Downloading the PyTorch environment info collection script ... 2025-05-07T19:50:21.3772877Z + wget -q https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py 2025-05-07T19:50:21.3773384Z 2025-05-07T19:50:21.4655293Z 2025-05-07T19:50:21.4656056Z [INFO] Collecting PyTorch environment info (will be needed for reporting issues to PyTorch) ... 2025-05-07T19:50:21.4677525Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary python collect_env.py 2025-05-07T19:50:26.8372688Z Collecting environment information... 2025-05-07T19:50:26.8373172Z PyTorch version: 2.8.0.dev20250507+cu118 2025-05-07T19:50:26.8373536Z Is debug build: False 2025-05-07T19:50:26.8373824Z CUDA used to build PyTorch: 11.8 2025-05-07T19:50:26.8374182Z ROCM used to build PyTorch: N/A 2025-05-07T19:50:26.8374370Z 2025-05-07T19:50:26.8374491Z OS: Amazon Linux 2023.7.20250428 (x86_64) 2025-05-07T19:50:26.8374808Z GCC version: Could not collect 2025-05-07T19:50:26.8375416Z Clang version: 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:50:26.8376030Z CMake version: version 4.0.2 2025-05-07T19:50:26.8376319Z Libc version: glibc-2.34 2025-05-07T19:50:26.8376482Z 2025-05-07T19:50:26.8376814Z Python version: 3.10.17 | packaged by conda-forge | (main, Apr 10 2025, 22:19:12) [GCC 13.3.0] (64-bit runtime) 2025-05-07T19:50:26.8377502Z Python platform: Linux-6.1.130-139.222.amzn2023.x86_64-x86_64-with-glibc2.34 2025-05-07T19:50:26.8377961Z Is CUDA available: False 2025-05-07T19:50:26.8378221Z CUDA runtime version: 11.8.89 2025-05-07T19:50:26.8378510Z CUDA_MODULE_LOADING set to: N/A 2025-05-07T19:50:26.8378833Z GPU models and configuration: Could not collect 2025-05-07T19:50:26.8379207Z Nvidia driver version: Could not collect 2025-05-07T19:50:26.8379522Z cuDNN version: Could not collect 2025-05-07T19:50:26.8379811Z HIP runtime version: N/A 2025-05-07T19:50:26.8380102Z MIOpen runtime version: N/A 2025-05-07T19:50:26.8380370Z Is XNNPACK available: True 2025-05-07T19:50:26.8380536Z 2025-05-07T19:50:26.8380635Z CPU: 2025-05-07T19:50:26.8380854Z Architecture: x86_64 2025-05-07T19:50:26.8381218Z CPU op-mode(s): 32-bit, 64-bit 2025-05-07T19:50:26.8381628Z Address sizes: 46 bits physical, 48 bits virtual 2025-05-07T19:50:26.8382055Z Byte Order: Little Endian 2025-05-07T19:50:26.8382394Z CPU(s): 96 2025-05-07T19:50:26.8382717Z On-line CPU(s) list: 0-95 2025-05-07T19:50:26.8383061Z Vendor ID: GenuineIntel 2025-05-07T19:50:26.8383824Z Model name: Intel(R) Xeon(R) Platinum 8275CL CPU @ 3.00GHz 2025-05-07T19:50:26.8384404Z CPU family: 6 2025-05-07T19:50:26.8384704Z Model: 85 2025-05-07T19:50:26.8385026Z Thread(s) per core: 2 2025-05-07T19:50:26.8385332Z Core(s) per socket: 24 2025-05-07T19:50:26.8385648Z Socket(s): 2 2025-05-07T19:50:26.8385940Z Stepping: 7 2025-05-07T19:50:26.8386269Z BogoMIPS: 5999.98 2025-05-07T19:50:26.8388797Z Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq monitor ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni 2025-05-07T19:50:26.8391082Z Hypervisor vendor: KVM 2025-05-07T19:50:26.8391390Z Virtualization type: full 2025-05-07T19:50:26.8391737Z L1d cache: 1.5 MiB (48 instances) 2025-05-07T19:50:26.8392098Z L1i cache: 1.5 MiB (48 instances) 2025-05-07T19:50:26.8392563Z L2 cache: 48 MiB (48 instances) 2025-05-07T19:50:26.8393144Z L3 cache: 71.5 MiB (2 instances) 2025-05-07T19:50:26.8393505Z NUMA node(s): 2 2025-05-07T19:50:26.8393923Z NUMA node0 CPU(s): 0-23,48-71 2025-05-07T19:50:26.8394292Z NUMA node1 CPU(s): 24-47,72-95 2025-05-07T19:50:26.8394829Z Vulnerability Gather data sampling: Unknown: Dependent on hypervisor status 2025-05-07T19:50:26.8395440Z Vulnerability Itlb multihit: KVM: Mitigation: VMX unsupported 2025-05-07T19:50:26.8395996Z Vulnerability L1tf: Mitigation; PTE Inversion 2025-05-07T19:50:26.8396668Z Vulnerability Mds: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:50:26.8397294Z Vulnerability Meltdown: Mitigation; PTI 2025-05-07T19:50:26.8397971Z Vulnerability Mmio stale data: Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown 2025-05-07T19:50:26.8398624Z Vulnerability Reg file data sampling: Not affected 2025-05-07T19:50:26.8399070Z Vulnerability Retbleed: Vulnerable 2025-05-07T19:50:26.8399475Z Vulnerability Spec rstack overflow: Not affected 2025-05-07T19:50:26.8399915Z Vulnerability Spec store bypass: Vulnerable 2025-05-07T19:50:26.8400547Z Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization 2025-05-07T19:50:26.8401434Z Vulnerability Spectre v2: Mitigation; Retpolines; STIBP disabled; RSB filling; PBRSB-eIBRS Not affected; BHI Retpoline 2025-05-07T19:50:26.8402125Z Vulnerability Srbds: Not affected 2025-05-07T19:50:26.8402502Z Vulnerability Tsx async abort: Not affected 2025-05-07T19:50:26.8402761Z 2025-05-07T19:50:26.8402867Z Versions of relevant libraries: 2025-05-07T19:50:26.8403142Z [pip3] numpy==2.2.5 2025-05-07T19:50:26.8403401Z [pip3] nvidia-cublas-cu11==11.11.3.6 2025-05-07T19:50:26.8403731Z [pip3] nvidia-cuda-cupti-cu11==11.8.87 2025-05-07T19:50:26.8404048Z [pip3] nvidia-cuda-nvrtc-cu11==11.8.89 2025-05-07T19:50:26.8404384Z [pip3] nvidia-cuda-runtime-cu11==11.8.89 2025-05-07T19:50:26.8404706Z [pip3] nvidia-cudnn-cu11==9.1.0.70 2025-05-07T19:50:26.8405014Z [pip3] nvidia-cufft-cu11==10.9.0.58 2025-05-07T19:50:26.8405404Z [pip3] nvidia-curand-cu11==10.3.0.86 2025-05-07T19:50:26.8405700Z [pip3] nvidia-cusolver-cu11==11.4.1.48 2025-05-07T19:50:26.8406104Z [pip3] nvidia-cusparse-cu11==11.7.5.86 2025-05-07T19:50:26.8406651Z [pip3] nvidia-nccl-cu11==2.21.5 2025-05-07T19:50:26.8406943Z [pip3] nvidia-nvtx-cu11==11.8.86 2025-05-07T19:50:26.8407236Z [pip3] pytorch-triton==3.3.0+git96316ce5 2025-05-07T19:50:26.8407560Z [pip3] torch==2.8.0.dev20250507+cu118 2025-05-07T19:50:26.8408132Z [conda] cuda-cudart 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8408699Z [conda] cuda-cudart-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8409239Z [conda] cuda-cupti 11.8.87 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8409787Z [conda] cuda-libraries 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8410358Z [conda] cuda-libraries-dev 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8410900Z [conda] cuda-nvrtc 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8411435Z [conda] cuda-nvrtc-dev 11.8.89 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8411952Z [conda] cuda-nvtx 11.8.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8412478Z [conda] cuda-runtime 11.8.0 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8413009Z [conda] libcublas 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8413536Z [conda] libcublas-dev 11.11.3.6 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8414071Z [conda] libcufft 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8414590Z [conda] libcufft-dev 10.9.0.58 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8415126Z [conda] libcurand 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8415654Z [conda] libcurand-dev 10.3.0.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8416204Z [conda] libcusolver 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8416756Z [conda] libcusolver-dev 11.4.1.48 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8417301Z [conda] libcusparse 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8417857Z [conda] libcusparse-dev 11.7.5.86 0 nvidia/label/cuda-11.8.0 2025-05-07T19:50:26.8418473Z [conda] numpy 2.2.5 py310hefbff90_0 conda-forge 2025-05-07T19:50:26.8418965Z [conda] nvidia-cublas-cu11 11.11.3.6 pypi_0 pypi 2025-05-07T19:50:26.8419488Z [conda] nvidia-cuda-cupti-cu11 11.8.87 pypi_0 pypi 2025-05-07T19:50:26.8420009Z [conda] nvidia-cuda-nvrtc-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:50:26.8420542Z [conda] nvidia-cuda-runtime-cu11 11.8.89 pypi_0 pypi 2025-05-07T19:50:26.8421050Z [conda] nvidia-cudnn-cu11 9.1.0.70 pypi_0 pypi 2025-05-07T19:50:26.8421548Z [conda] nvidia-cufft-cu11 10.9.0.58 pypi_0 pypi 2025-05-07T19:50:26.8422033Z [conda] nvidia-curand-cu11 10.3.0.86 pypi_0 pypi 2025-05-07T19:50:26.8422550Z [conda] nvidia-cusolver-cu11 11.4.1.48 pypi_0 pypi 2025-05-07T19:50:26.8423070Z [conda] nvidia-cusparse-cu11 11.7.5.86 pypi_0 pypi 2025-05-07T19:50:26.8423565Z [conda] nvidia-nccl-cu11 2.21.5 pypi_0 pypi 2025-05-07T19:50:26.8424058Z [conda] nvidia-nvtx-cu11 11.8.86 pypi_0 pypi 2025-05-07T19:50:26.8424558Z [conda] pytorch-triton 3.3.0+git96316ce5 pypi_0 pypi 2025-05-07T19:50:26.8425123Z [conda] torch 2.8.0.dev20250507+cu118 pypi_0 pypi 2025-05-07T19:50:26.8425412Z 2025-05-07T19:50:26.9385679Z ##[group]Run . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:50:26.9386295Z . $PRELUDE; install_cudnn $BUILD_ENV "$(pwd)/build_only/cudnn" 11.8.0 2025-05-07T19:50:26.9386839Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:50:26.9387160Z env: 2025-05-07T19:50:26.9387370Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:50:26.9387678Z BUILD_ENV: build_binary 2025-05-07T19:50:26.9387911Z BUILD_TARGET: default 2025-05-07T19:50:26.9388151Z BUILD_VARIANT: cuda 2025-05-07T19:50:26.9388372Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:50:26.9388622Z ##[endgroup] 2025-05-07T19:50:27.4035252Z ################################################################################ 2025-05-07T19:50:27.4036298Z # Install cuDNN 2025-05-07T19:50:27.4036557Z # 2025-05-07T19:50:27.4052192Z # [2025-05-07T19:50:27.404Z] + install_cudnn build_binary /__w/FBGEMM/FBGEMM/build_only/cudnn 11.8.0 2025-05-07T19:50:27.4053815Z ################################################################################ 2025-05-07T19:50:27.4054090Z 2025-05-07T19:50:27.4065421Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:50:27.4989775Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:50:27.4991073Z [INSTALL] cuda_concat_version is determined to be: 118 2025-05-07T19:50:27.4991772Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:27.4992018Z 2025-05-07T19:50:27.5006100Z 2025-05-07T19:50:27.5006761Z + mkdir -p /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:50:27.5007518Z 2025-05-07T19:50:27.5022458Z 2025-05-07T19:50:27.5037892Z [INSTALL] Downloading cuDNN to /tmp/tmp.ttUgZWamwc ... 2025-05-07T19:50:27.5059760Z [EXEC] [ATTEMPT 0/3] + wget -q https://developer.download.nvidia.com/compute/redist/cudnn/v8.7.0/local_installers/11.8/cudnn-linux-x86_64-8.7.0.84_cuda11-archive.tar.xz -O cudnn.tar.xz 2025-05-07T19:50:35.7115629Z [INSTALL] Unpacking cuDNN ... 2025-05-07T19:50:35.7116657Z + tar -xvf cudnn.tar.xz 2025-05-07T19:50:35.7117155Z 2025-05-07T19:50:35.7145602Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/ 2025-05-07T19:50:35.7146707Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/ 2025-05-07T19:50:35.7147381Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static.a 2025-05-07T19:50:38.0968197Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer_static_v8.a 2025-05-07T19:50:38.0968819Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static.a 2025-05-07T19:50:40.3533320Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train_static_v8.a 2025-05-07T19:50:40.3535043Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static.a 2025-05-07T19:50:48.5535928Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer_static_v8.a 2025-05-07T19:50:48.5536580Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static.a 2025-05-07T19:50:50.1458513Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train_static_v8.a 2025-05-07T19:50:50.1459486Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static.a 2025-05-07T19:50:51.8355666Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer_static_v8.a 2025-05-07T19:50:51.8357408Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static.a 2025-05-07T19:50:53.3436783Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train_static_v8.a 2025-05-07T19:50:53.3437365Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8 2025-05-07T19:50:53.3437845Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so 2025-05-07T19:50:53.3438322Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn.so.8.7.0 2025-05-07T19:50:53.3452050Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8 2025-05-07T19:50:53.3452746Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so 2025-05-07T19:50:53.3453285Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_infer.so.8.7.0 2025-05-07T19:50:55.7224489Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8 2025-05-07T19:50:55.7226148Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so 2025-05-07T19:50:55.7227691Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_adv_train.so.8.7.0 2025-05-07T19:50:57.9791129Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so 2025-05-07T19:50:57.9793951Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8 2025-05-07T19:50:57.9795547Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_infer.so.8.7.0 2025-05-07T19:51:06.5122514Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so 2025-05-07T19:51:06.5123153Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8.7.0 2025-05-07T19:51:08.1302488Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_cnn_train.so.8 2025-05-07T19:51:08.1303200Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8.7.0 2025-05-07T19:51:09.8115010Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so 2025-05-07T19:51:09.8117453Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_infer.so.8 2025-05-07T19:51:09.8119040Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8.7.0 2025-05-07T19:51:11.3231459Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so 2025-05-07T19:51:11.3232409Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib/libcudnn_ops_train.so.8 2025-05-07T19:51:11.3232981Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/ 2025-05-07T19:51:11.3233737Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_v8.h 2025-05-07T19:51:11.3234497Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer_v8.h 2025-05-07T19:51:11.3235049Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train_v8.h 2025-05-07T19:51:11.3235732Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend_v8.h 2025-05-07T19:51:11.3236772Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer_v8.h 2025-05-07T19:51:11.3237693Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train_v8.h 2025-05-07T19:51:11.3238631Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer_v8.h 2025-05-07T19:51:11.3239772Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train_v8.h 2025-05-07T19:51:11.3240743Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version_v8.h 2025-05-07T19:51:11.3241644Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn.h 2025-05-07T19:51:11.3242597Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_infer.h 2025-05-07T19:51:11.3243196Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_adv_train.h 2025-05-07T19:51:11.3244088Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_backend.h 2025-05-07T19:51:11.3244577Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_infer.h 2025-05-07T19:51:11.3245084Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_cnn_train.h 2025-05-07T19:51:11.3245586Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_infer.h 2025-05-07T19:51:11.3246091Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_ops_train.h 2025-05-07T19:51:11.3246594Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include/cudnn_version.h 2025-05-07T19:51:11.3247030Z cudnn-linux-x86_64-8.7.0.84_cuda11-archive/LICENSE 2025-05-07T19:51:11.3256816Z 2025-05-07T19:51:11.3257360Z [INSTALL] Moving cuDNN files to /__w/FBGEMM/FBGEMM/build_only/cudnn ... 2025-05-07T19:51:11.3258361Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:11.3258626Z 2025-05-07T19:51:11.3273803Z 2025-05-07T19:51:11.3274039Z + rm -rf /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:11.3274423Z 2025-05-07T19:51:11.3293717Z 2025-05-07T19:51:11.3294778Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/include /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:51:11.3295954Z 2025-05-07T19:51:11.3328053Z 2025-05-07T19:51:11.3330788Z + mv cudnn-linux-x86_64-8.7.0.84_cuda11-archive/lib /__w/FBGEMM/FBGEMM/build_only/cudnn 2025-05-07T19:51:11.3332333Z 2025-05-07T19:51:12.8425954Z 2025-05-07T19:51:12.8427014Z /__w/FBGEMM/FBGEMM 2025-05-07T19:51:12.8427850Z + rm -rf /tmp/tmp.ttUgZWamwc 2025-05-07T19:51:12.8428401Z 2025-05-07T19:51:12.8944931Z 2025-05-07T19:51:12.8949143Z [INSTALL] Set environment variables CUDNN_INCLUDE_DIR and CUDNN_LIBRARY ... 2025-05-07T19:51:12.8950619Z + conda env config vars set -n build_binary CUDNN_INCLUDE_DIR=/__w/FBGEMM/FBGEMM/build_only/cudnn/include CUDNN_LIBRARY=/__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:12.8951289Z 2025-05-07T19:51:13.3102768Z 2025-05-07T19:51:13.3103398Z [INSTALL] Successfully installed cuDNN (for CUDA 11.8.0) 2025-05-07T19:51:13.3196433Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:51:13.3197073Z . $PRELUDE; cd fbgemm_gpu; prepare_fbgemm_gpu_build $BUILD_ENV 2025-05-07T19:51:13.3197777Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:51:13.3198143Z env: 2025-05-07T19:51:13.3198445Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:51:13.3198790Z BUILD_ENV: build_binary 2025-05-07T19:51:13.3199102Z BUILD_TARGET: default 2025-05-07T19:51:13.3199359Z BUILD_VARIANT: cuda 2025-05-07T19:51:13.3199645Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:51:13.3199951Z ##[endgroup] 2025-05-07T19:51:13.7605213Z ################################################################################ 2025-05-07T19:51:13.7606321Z # Prepare FBGEMM-GPU Build 2025-05-07T19:51:13.7607075Z # 2025-05-07T19:51:13.7619840Z # [2025-05-07T19:51:13.761Z] + prepare_fbgemm_gpu_build build_binary 2025-05-07T19:51:13.7620578Z ################################################################################ 2025-05-07T19:51:13.7620825Z 2025-05-07T19:51:13.7641525Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:51:13.8523977Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:51:13.8540161Z [BUILD] Running git submodules update ... 2025-05-07T19:51:13.8565295Z [EXEC] [ATTEMPT 0/3] + git submodule sync 2025-05-07T19:51:13.8878985Z Synchronizing submodule url for '../external/asmjit' 2025-05-07T19:51:13.8879889Z Synchronizing submodule url for '../external/composable_kernel' 2025-05-07T19:51:13.8880393Z Synchronizing submodule url for '../external/cpuinfo' 2025-05-07T19:51:13.8880820Z Synchronizing submodule url for '../external/cutlass' 2025-05-07T19:51:13.8881377Z Synchronizing submodule url for '../external/googletest' 2025-05-07T19:51:13.8881818Z Synchronizing submodule url for '../external/hipify_torch' 2025-05-07T19:51:13.8882253Z Synchronizing submodule url for '../external/json' 2025-05-07T19:51:13.8913270Z [EXEC] [ATTEMPT 0/3] + git submodule update --init --recursive 2025-05-07T19:51:13.9383023Z [BUILD] Installing other build dependencies ... 2025-05-07T19:51:13.9401473Z [EXEC] [ATTEMPT 0/3] + conda run --no-capture-output -n build_binary python -m pip install -r requirements.txt 2025-05-07T19:51:16.0702717Z Collecting backports.tarfile (from -r requirements.txt (line 13)) 2025-05-07T19:51:16.0873754Z Downloading backports.tarfile-1.2.0-py3-none-any.whl.metadata (2.0 kB) 2025-05-07T19:51:16.0961158Z Requirement already satisfied: build in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 14)) (1.2.2.post1) 2025-05-07T19:51:16.2284493Z Collecting cmake (from -r requirements.txt (line 15)) 2025-05-07T19:51:16.2318789Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (6.3 kB) 2025-05-07T19:51:16.2396782Z Requirement already satisfied: click in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 16)) (8.1.8) 2025-05-07T19:51:16.2400783Z Requirement already satisfied: hypothesis in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 17)) (6.131.14) 2025-05-07T19:51:16.2403450Z Requirement already satisfied: jinja2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 18)) (3.1.6) 2025-05-07T19:51:16.2405100Z Requirement already satisfied: mpmath==1.3.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 19)) (1.3.0) 2025-05-07T19:51:16.2716585Z Collecting ninja (from -r requirements.txt (line 20)) 2025-05-07T19:51:16.2753736Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl.metadata (5.0 kB) 2025-05-07T19:51:16.2825153Z Requirement already satisfied: numpy>=2.0.2 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 21)) (2.2.5) 2025-05-07T19:51:16.2976165Z Collecting pyre-extensions (from -r requirements.txt (line 22)) 2025-05-07T19:51:16.3009547Z Downloading pyre_extensions-0.0.32-py3-none-any.whl.metadata (4.0 kB) 2025-05-07T19:51:16.3084270Z Requirement already satisfied: pyyaml in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 23)) (6.0.2) 2025-05-07T19:51:16.3088313Z Requirement already satisfied: scikit-build in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 24)) (0.18.1) 2025-05-07T19:51:16.3092375Z Requirement already satisfied: setuptools in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from -r requirements.txt (line 25)) (78.1.1) 2025-05-07T19:51:16.3312112Z Collecting setuptools_git_versioning (from -r requirements.txt (line 26)) 2025-05-07T19:51:16.3341462Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl.metadata (6.1 kB) 2025-05-07T19:51:16.3537257Z Collecting tabulate (from -r requirements.txt (line 27)) 2025-05-07T19:51:16.3563850Z Downloading tabulate-0.9.0-py3-none-any.whl.metadata (34 kB) 2025-05-07T19:51:16.3818361Z Collecting patchelf (from -r requirements.txt (line 28)) 2025-05-07T19:51:16.3852960Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl.metadata (3.5 kB) 2025-05-07T19:51:16.4001309Z Requirement already satisfied: packaging>=19.1 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (25.0) 2025-05-07T19:51:16.4002962Z Requirement already satisfied: pyproject_hooks in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (1.2.0) 2025-05-07T19:51:16.4010888Z Requirement already satisfied: tomli>=1.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from build->-r requirements.txt (line 14)) (2.2.1) 2025-05-07T19:51:16.4127757Z Requirement already satisfied: attrs>=22.2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (25.3.0) 2025-05-07T19:51:16.4132920Z Requirement already satisfied: exceptiongroup>=1.0.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (1.2.2) 2025-05-07T19:51:16.4139075Z Requirement already satisfied: sortedcontainers<3.0.0,>=2.1.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from hypothesis->-r requirements.txt (line 17)) (2.4.0) 2025-05-07T19:51:16.4158305Z Requirement already satisfied: MarkupSafe>=2.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from jinja2->-r requirements.txt (line 18)) (3.0.2) 2025-05-07T19:51:16.4291867Z Collecting typing-inspect (from pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:51:16.4325100Z Downloading typing_inspect-0.9.0-py3-none-any.whl.metadata (1.5 kB) 2025-05-07T19:51:16.4391556Z Requirement already satisfied: typing-extensions in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from pyre-extensions->-r requirements.txt (line 22)) (4.13.2) 2025-05-07T19:51:16.4436315Z Requirement already satisfied: distro in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from scikit-build->-r requirements.txt (line 24)) (1.9.0) 2025-05-07T19:51:16.4441938Z Requirement already satisfied: wheel>=0.32.0 in /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages (from scikit-build->-r requirements.txt (line 24)) (0.45.1) 2025-05-07T19:51:16.4756217Z Collecting mypy-extensions>=0.3.0 (from typing-inspect->pyre-extensions->-r requirements.txt (line 22)) 2025-05-07T19:51:16.4790384Z Downloading mypy_extensions-1.1.0-py3-none-any.whl.metadata (1.1 kB) 2025-05-07T19:51:16.4892312Z Downloading backports.tarfile-1.2.0-py3-none-any.whl (30 kB) 2025-05-07T19:51:16.4985493Z Downloading cmake-4.0.0-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (27.9 MB) 2025-05-07T19:51:16.6274698Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 27.9/27.9 MB 221.3 MB/s eta 0:00:00 2025-05-07T19:51:16.6314331Z Downloading ninja-1.11.1.4-py3-none-manylinux_2_12_x86_64.manylinux2010_x86_64.whl (422 kB) 2025-05-07T19:51:16.6399289Z Downloading pyre_extensions-0.0.32-py3-none-any.whl (12 kB) 2025-05-07T19:51:16.6475800Z Downloading setuptools_git_versioning-2.1.0-py3-none-any.whl (10 kB) 2025-05-07T19:51:16.6558417Z Downloading tabulate-0.9.0-py3-none-any.whl (35 kB) 2025-05-07T19:51:16.6619894Z Downloading patchelf-0.17.2.2-py3-none-manylinux1_x86_64.manylinux_2_5_x86_64.musllinux_1_1_x86_64.whl (466 kB) 2025-05-07T19:51:16.6704201Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-05-07T19:51:16.6767976Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-05-07T19:51:16.8580040Z Installing collected packages: tabulate, setuptools_git_versioning, patchelf, ninja, mypy-extensions, cmake, backports.tarfile, typing-inspect, pyre-extensions 2025-05-07T19:51:17.7908850Z 2025-05-07T19:51:17.7971344Z Successfully installed backports.tarfile-1.2.0 cmake-4.0.0 mypy-extensions-1.1.0 ninja-1.11.1.4 patchelf-0.17.2.2 pyre-extensions-0.0.32 setuptools_git_versioning-2.1.0 tabulate-0.9.0 typing-inspect-0.9.0 2025-05-07T19:51:17.7973658Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:51:17.9483732Z ################################################################################ 2025-05-07T19:51:17.9484165Z # Install PyTorch (PyTorch PIP) 2025-05-07T19:51:17.9484461Z # 2025-05-07T19:51:17.9505320Z # [2025-05-07T19:51:17.949Z] + install_triton_pip build_binary 2025-05-07T19:51:17.9505822Z ################################################################################ 2025-05-07T19:51:17.9506108Z 2025-05-07T19:51:17.9506347Z [BUILD] Installing pytorch-triton nightly/3.2.0+git4b3bb1f8 from PIP ... 2025-05-07T19:51:17.9506833Z ################################################################################ 2025-05-07T19:51:17.9507232Z # Install Package From PyTorch PIP: pytorch-triton 2025-05-07T19:51:17.9507593Z # 2025-05-07T19:51:17.9519670Z # [2025-05-07T19:51:17.951Z] + install_from_pytorch_pip build_binary pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:51:17.9520235Z ################################################################################ 2025-05-07T19:51:17.9520480Z 2025-05-07T19:51:17.9536040Z [EXEC] [ATTEMPT 0/3] + wget -q --timeout 1 pypi.org -O /dev/null 2025-05-07T19:51:18.0361124Z [CHECK] Network does not appear to be blocked. 2025-05-07T19:51:18.0361816Z ################################################################################ 2025-05-07T19:51:18.0362193Z # Prepare PIP Arguments (PyTorch PIP) 2025-05-07T19:51:18.0362485Z # 2025-05-07T19:51:18.0384447Z # [2025-05-07T19:51:18.037Z] + __prepare_pip_arguments pytorch-triton nightly/3.2.0+git4b3bb1f8 2025-05-07T19:51:18.0385027Z ################################################################################ 2025-05-07T19:51:18.0385278Z 2025-05-07T19:51:18.0433040Z [INSTALL] Extracted package (channel, version): (nightly, 3.2.0+git4b3bb1f8) 2025-05-07T19:51:18.0444574Z [INSTALL] Using a non-RELEASE channel: nightly ... 2025-05-07T19:51:18.0445188Z [INSTALL] Extracted the full PIP channel: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:18.0448113Z [INSTALL] Extracted the full PIP package: --pre pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:51:18.0455316Z [INSTALL] Attempting to install [pytorch-triton, 3.2.0+git4b3bb1f8] from PyTorch PIP using channel https://download.pytorch.org/whl/nightly/ ... 2025-05-07T19:51:18.0476316Z [EXEC] [ATTEMPT 0/3] + conda run -n build_binary pip install --pre pytorch-triton==3.2.0+git4b3bb1f8 --index-url https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:23.7516022Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-05-07T19:51:23.7517553Z torch 2.8.0.dev20250507+cu118 requires pytorch-triton==3.3.0+git96316ce5; platform_system == "Linux" and platform_machine == "x86_64", but you have pytorch-triton 3.2.0+git4b3bb1f8 which is incompatible. 2025-05-07T19:51:23.7519762Z WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager, possibly rendering your system unusable. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv. Use the --root-user-action option if you know what you are doing and want to suppress this warning. 2025-05-07T19:51:23.7521249Z 2025-05-07T19:51:23.7521454Z Looking in indexes: https://download.pytorch.org/whl/nightly/ 2025-05-07T19:51:23.7521904Z Collecting pytorch-triton==3.2.0+git4b3bb1f8 2025-05-07T19:51:23.7522781Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.metadata (1.3 kB) 2025-05-07T19:51:23.7524148Z Downloading https://download.pytorch.org/whl/nightly/pytorch_triton-3.2.0%2Bgit4b3bb1f8-cp310-cp310-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (166.5 MB) 2025-05-07T19:51:23.7525333Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 166.5/166.5 MB 151.4 MB/s eta 0:00:00 2025-05-07T19:51:23.7525753Z Installing collected packages: pytorch-triton 2025-05-07T19:51:23.7526237Z Attempting uninstall: pytorch-triton 2025-05-07T19:51:23.7526630Z Found existing installation: pytorch-triton 3.3.0+git96316ce5 2025-05-07T19:51:23.7527078Z Uninstalling pytorch-triton-3.3.0+git96316ce5: 2025-05-07T19:51:23.7527503Z Successfully uninstalled pytorch-triton-3.3.0+git96316ce5 2025-05-07T19:51:23.7527976Z Successfully installed pytorch-triton-3.2.0+git4b3bb1f8 2025-05-07T19:51:23.7528244Z 2025-05-07T19:51:25.8834112Z [CHECK] Python (sub-)package 'triton' found ... 2025-05-07T19:51:25.8834586Z [CHECK] Printing out the pytorch-triton version ... 2025-05-07T19:51:27.9551077Z ################################################################################ 2025-05-07T19:51:27.9552394Z [CHECK] The installed VERSION of pytorch-triton is: 3.2.0 2025-05-07T19:51:27.9553852Z ################################################################################ 2025-05-07T19:51:27.9554097Z 2025-05-07T19:51:29.9391411Z [CHECK] Python (sub-)package 'numpy' found ... 2025-05-07T19:51:31.9733245Z [CHECK] Python (sub-)package 'skbuild' found ... 2025-05-07T19:51:31.9736767Z [BUILD] Successfully ran git submodules update 2025-05-07T19:51:31.9829450Z ##[group]Run . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:51:31.9830195Z . $PRELUDE; cd fbgemm_gpu; build_fbgemm_gpu_package $BUILD_ENV nightly default/cuda 2025-05-07T19:51:31.9830861Z shell: bash --noprofile --norc -e -o pipefail {0} 2025-05-07T19:51:31.9831196Z env: 2025-05-07T19:51:31.9832067Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T19:51:31.9832394Z BUILD_ENV: build_binary 2025-05-07T19:51:31.9832935Z BUILD_TARGET: default 2025-05-07T19:51:31.9833187Z BUILD_VARIANT: cuda 2025-05-07T19:51:31.9833460Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T19:51:31.9833931Z ##[endgroup] 2025-05-07T19:51:32.3997654Z [BUILD] BUILD_TARGET_VARIANT: default/cuda 2025-05-07T19:51:32.3998044Z [BUILD] Extracted build target: default 2025-05-07T19:51:32.3998398Z [BUILD] Extracted build variant: cuda 2025-05-07T19:51:34.2019197Z /github/home/miniconda/envs/build_binary/bin/cc 2025-05-07T19:51:34.2020004Z 2025-05-07T19:51:34.2600483Z [CHECK] Binary cc found in PATH 2025-05-07T19:51:36.1079788Z /github/home/miniconda/envs/build_binary/bin/gcc 2025-05-07T19:51:36.1080579Z 2025-05-07T19:51:36.1840109Z [CHECK] Binary gcc found in PATH 2025-05-07T19:51:38.0295516Z /github/home/miniconda/envs/build_binary/bin/c++ 2025-05-07T19:51:38.0296377Z 2025-05-07T19:51:38.1032787Z [CHECK] Binary c++ found in PATH 2025-05-07T19:51:39.9459063Z /github/home/miniconda/envs/build_binary/bin/g++ 2025-05-07T19:51:39.9459898Z 2025-05-07T19:51:40.0206892Z [CHECK] Binary g++ found in PATH 2025-05-07T19:51:41.9188432Z [BUILD] Extracted and set Python tag: py310 2025-05-07T19:51:41.9189877Z [BUILD] Extracted and set Python platform name: manylinux_2_28_x86_64 2025-05-07T19:51:41.9436769Z core = 24 2025-05-07T19:51:41.9641667Z sockets = 2 2025-05-07T19:51:41.9642688Z [BUILD] Set multicore run option for setup.py: -j 48 2025-05-07T19:51:41.9643754Z [CHECK] LD_LIBRARY_PATH = 2025-05-07T19:51:41.9644550Z [BUILD] Running pre-build cleanups ... 2025-05-07T19:51:41.9645440Z + rm -rf dist 2025-05-07T19:51:41.9645800Z 2025-05-07T19:51:41.9659161Z 2025-05-07T19:51:41.9660172Z + conda run --no-capture-output -n build_binary python setup.py clean 2025-05-07T19:51:41.9660520Z 2025-05-07T19:51:45.1281634Z INFO:root:running clean 2025-05-07T19:51:45.1282040Z [SETUP.PY] ARGV: ['setup.py', 'clean'] 2025-05-07T19:51:45.1283154Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:51:45.1284274Z [SETUP.PY] Other arguments: ['clean'] 2025-05-07T19:51:45.1284769Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:51:45.1285356Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:51:45.1285944Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:51:45.1286537Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:51:45.1286950Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:51:45.1288234Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:51:45.4405645Z 2025-05-07T19:51:45.4406974Z [BUILD] Printing git status ... 2025-05-07T19:51:45.4407871Z + git status 2025-05-07T19:51:45.4408230Z 2025-05-07T19:51:45.9987180Z HEAD detached at pull/4066/merge 2025-05-07T19:51:45.9987520Z Untracked files: 2025-05-07T19:51:45.9987921Z (use "git add ..." to include in what will be committed) 2025-05-07T19:51:45.9988304Z ../build_only/ 2025-05-07T19:51:45.9988554Z ../collect_env.py 2025-05-07T19:51:45.9988816Z fbgemm_gpu/docs/version.py 2025-05-07T19:51:45.9989016Z 2025-05-07T19:51:45.9990671Z nothing added to commit but untracked files present (use "git add" to track) 2025-05-07T19:51:45.9991049Z 2025-05-07T19:51:45.9991160Z + git diff 2025-05-07T19:51:45.9991284Z 2025-05-07T19:51:46.0262013Z 2025-05-07T19:51:46.0262401Z ################################################################################ 2025-05-07T19:51:46.0262829Z # Configure FBGEMM-GPU Build 2025-05-07T19:51:46.0263115Z # 2025-05-07T19:51:46.0277956Z # [2025-05-07T19:51:46.027Z] + __configure_fbgemm_gpu_build 2025-05-07T19:51:46.0279111Z ################################################################################ 2025-05-07T19:51:46.0279973Z 2025-05-07T19:51:46.0284405Z [BUILD] Setting the build target: default ... 2025-05-07T19:51:46.0285792Z [BUILD] Configuring build as CUDA variant (this is the default behavior) ... 2025-05-07T19:51:47.8612254Z /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:51:47.8612567Z 2025-05-07T19:51:47.9436803Z [CHECK] Binary nvcc found in PATH 2025-05-07T19:51:49.7560214Z /__w/FBGEMM/FBGEMM/build_only/cudnn/include 2025-05-07T19:51:49.7560575Z 2025-05-07T19:51:49.8148696Z [CHECK] Environment variable CUDNN_INCLUDE_DIR is defined in the Conda environment 2025-05-07T19:51:51.6457426Z /__w/FBGEMM/FBGEMM/build_only/cudnn/lib 2025-05-07T19:51:51.6458145Z 2025-05-07T19:51:51.7042457Z [CHECK] Environment variable CUDNN_LIBRARY is defined in the Conda environment 2025-05-07T19:51:53.5134181Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:51:53.5134588Z 2025-05-07T19:51:53.5716782Z [CHECK] Environment variable NVML_LIB_PATH is defined in the Conda environment 2025-05-07T19:51:55.4361948Z [BUILD] Using the default architectures for CUDA nvcc: NVIDIA (R) Cuda compiler driver 2025-05-07T19:51:55.4363706Z Copyright (c) 2005-2022 NVIDIA Corporation 2025-05-07T19:51:55.4364079Z Built on Wed_Sep_21_10:33:58_PDT_2022 2025-05-07T19:51:55.4364429Z Cuda compilation tools, release 11.8, V11.8.89 2025-05-07T19:51:55.4364846Z Build cuda_11.8.r11.8/compiler.31833905_0 ... 2025-05-07T19:51:55.4365222Z [BUILD] Setting the following CUDA targets: 7.0;8.0 2025-05-07T19:51:55.4365677Z [BUILD] Looking up NVML filepath ... 2025-05-07T19:51:57.2957479Z [BUILD] Looking up NCCL filepath ... 2025-05-07T19:52:01.0895912Z [BUILD] Setting NVCC verbose mode ... 2025-05-07T19:52:01.0896579Z + conda env config vars set -n build_binary NVCC_VERBOSE=1 2025-05-07T19:52:01.0896888Z 2025-05-07T19:52:01.5147066Z 2025-05-07T19:52:01.5147957Z [BUILD] Setting CUDA build args ... 2025-05-07T19:52:01.5155892Z [BUILD] Looking up CUDA version ... 2025-05-07T19:52:05.2555101Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:52:05.2555555Z 2025-05-07T19:52:07.1675139Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:07.1676074Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:07.1676863Z 2025-05-07T19:52:07.1677131Z [BUILD] Setting NVCC flags ... 2025-05-07T19:52:07.1678192Z + conda env config vars set -n build_binary NVCC_PREPEND_FLAGS="-std=c++17 -Xcompiler -std=c++17 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler" 2025-05-07T19:52:07.1679100Z 2025-05-07T19:52:07.5880370Z 2025-05-07T19:52:07.5881694Z + conda run -n build_binary printenv NVCC_PREPEND_FLAGS 2025-05-07T19:52:07.5883171Z 2025-05-07T19:52:09.4035679Z -std=c++17 -Xcompiler -std=c++17 -Xcompiler -stdlib=libstdc++ -ccbin /github/home/miniconda/envs/build_binary/bin/c++ -allow-unsupported-compiler 2025-05-07T19:52:09.4036465Z 2025-05-07T19:52:09.4650200Z 2025-05-07T19:52:09.4651142Z [BUILD] Setting CUDA build args ... 2025-05-07T19:52:09.4652883Z + conda run -n build_binary c++ --version 2025-05-07T19:52:09.4653546Z 2025-05-07T19:52:11.2963075Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:11.2965544Z Target: x86_64-conda-linux-gnu 2025-05-07T19:52:11.2966195Z Thread model: posix 2025-05-07T19:52:11.2966528Z InstalledDir: /github/home/miniconda/envs/build_binary/bin 2025-05-07T19:52:11.2967177Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:11.2967637Z 2025-05-07T19:52:11.3525547Z 2025-05-07T19:52:11.3526026Z + conda run -n build_binary c++ --version | grep -i clang 2025-05-07T19:52:11.3526478Z 2025-05-07T19:52:13.2709319Z clang version 16.0.6 (https://github.com/conda-forge/clangdev-feedstock db6970f6bb85e49860ed8bab43ebf165b5c55cc4) 2025-05-07T19:52:13.2710820Z Configuration file: /github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-clang++.cfg 2025-05-07T19:52:13.2711346Z 2025-05-07T19:52:13.2711753Z [BUILD] Clang is available; configuring for Clang-based build ... 2025-05-07T19:52:15.1642928Z .github/scripts/fbgemm_gpu_build.bash: line 370: [: : integer expression expected 2025-05-07T19:52:15.1643554Z [BUILD] Enabling debug features in the build ... 2025-05-07T19:52:15.1645498Z [BUILD] FBGEMM_GPU build arguments have been set: --verbose --build-target=default --build-variant=cuda --nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 -DTORCH_CUDA_ARCH_LIST='7.0;8.0' -DCMAKE_CXX_STANDARD=17 --cxxprefix=/github/home/miniconda/envs/build_binary --debug 2025-05-07T19:52:15.1647387Z ################################################################################ 2025-05-07T19:52:15.1647764Z # Build FBGEMM-GPU Package (Wheel) 2025-05-07T19:52:15.1648079Z # 2025-05-07T19:52:15.1671056Z # [2025-05-07T19:52:15.166Z] + build_fbgemm_gpu_package build_binary nightly default/cuda 2025-05-07T19:52:15.1672796Z ################################################################################ 2025-05-07T19:52:15.1673495Z 2025-05-07T19:52:15.1674094Z [BUILD] Building FBGEMM wheel (TARGET=default, VARIANT=cuda) ... 2025-05-07T19:52:15.1679075Z + conda run --no-capture-output -n build_binary python -m build --wheel --no-isolation --config-setting=--build-option=--verbose --config-setting=--build-option=--build-target=default --config-setting=--build-option=--build-variant=cuda --config-setting=--build-option=--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so --config-setting=--build-option=--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 --config-setting=--build-option=-DTORCH_CUDA_ARCH_LIST='7.0;8.0' --config-setting=--build-option=-DCMAKE_CXX_STANDARD=17 --config-setting=--build-option=--cxxprefix=/github/home/miniconda/envs/build_binary --config-setting=--build-option=--debug --config-setting=--build-option=--package_channel=nightly --config-setting=--build-option=--python-tag=py310 --config-setting=--build-option=--plat-name=manylinux_2_28_x86_64 2025-05-07T19:52:15.1682621Z 2025-05-07T19:52:17.0361073Z * Getting build dependencies for wheel... 2025-05-07T19:52:18.3277179Z INFO:root:running egg_info 2025-05-07T19:52:18.3304034Z INFO:root:creating fbgemm_gpu_nightly.egg-info 2025-05-07T19:52:18.3304748Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T19:52:18.3309441Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T19:52:18.3310573Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T19:52:18.3311831Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T19:52:18.3313445Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:18.3374903Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:18.3390963Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T19:52:18.3392398Z [SETUP.PY] ARGV: ['setup.py', 'egg_info'] 2025-05-07T19:52:18.3394425Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=False, debug=False, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path=None, nccl_lib_path=None, use_fb_only=False, cxxprefix=None) 2025-05-07T19:52:18.3396238Z [SETUP.PY] Other arguments: ['egg_info'] 2025-05-07T19:52:18.3396964Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:52:18.3397802Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:52:18.3398677Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:52:18.3399678Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:52:18.3400270Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:52:18.3402122Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', "-DCMAKE_C_FLAGS=''", "-DCMAKE_CXX_FLAGS=''"] 2025-05-07T19:52:18.6116193Z * Building wheel... 2025-05-07T19:52:19.9060093Z [SETUP.PY] ARGV: ['setup.py', 'bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-2e_4cqdf', '--verbose', '--build-target=default', '--build-variant=cuda', '--nvml_lib_path=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '--nccl_lib_path=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--cxxprefix=/github/home/miniconda/envs/build_binary', '--debug', '--package_channel=nightly', '--python-tag=py310', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:52:19.9064220Z [SETUP.PY] Parsed setup.py arguments: Namespace(verbose=True, debug=True, dryrun=False, build_target='default', build_variant='cuda', package_channel='nightly', nvml_lib_path='/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', nccl_lib_path='/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', use_fb_only=False, cxxprefix='/github/home/miniconda/envs/build_binary') 2025-05-07T19:52:19.9066760Z [SETUP.PY] Other arguments: ['bdist_wheel', '--dist-dir', '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-2e_4cqdf', '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17', '--python-tag=py310', '--plat-name=manylinux_2_28_x86_64'] 2025-05-07T19:52:19.9067882Z [SETUP.PY] CUDA CUB directory environment variable not set. Using default CUB location. 2025-05-07T19:52:19.9068576Z [SETUP.PY] Using CUDA = /github/home/miniconda/envs/build_binary 2025-05-07T19:52:19.9069152Z [SETUP.PY] Generating version file at: /__w/FBGEMM/FBGEMM/fbgemm_gpu/fbgemm_gpu/docs/version.py 2025-05-07T19:52:19.9069716Z [SETUP.PY] Setting the FBGEMM build target: default ... 2025-05-07T19:52:19.9070218Z [SETUP.PY] Setting the FBGEMM build variant: cuda ... 2025-05-07T19:52:19.9076172Z [SETUP.PY] Passing CMake arguments: ['-DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch', '-D_GLIBCXX_USE_CXX11_ABI=1', '-DCMAKE_VERBOSE_MAKEFILE=ON', '-DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE', '-DFBGEMM_BUILD_TARGET=default', '-DFBGEMM_BUILD_VARIANT=cuda', '-DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so', '-DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include', '-DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2', '-DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc', '-DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++', "-DCMAKE_C_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", "-DCMAKE_CXX_FLAGS='-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'", '-DTORCH_CUDA_ARCH_LIST=7.0;8.0', '-DCMAKE_CXX_STANDARD=17'] 2025-05-07T19:52:19.9081831Z 2025-05-07T19:52:19.9081836Z 2025-05-07T19:52:19.9082006Z -------------------------------------------------------------------------------- 2025-05-07T19:52:19.9082427Z -- Trying 'Ninja' generator 2025-05-07T19:52:19.9082695Z -------------------------------- 2025-05-07T19:52:19.9082971Z --------------------------- 2025-05-07T19:52:19.9083212Z ---------------------- 2025-05-07T19:52:19.9087089Z ----------------- 2025-05-07T19:52:19.9087287Z ------------ 2025-05-07T19:52:19.9087490Z ------- 2025-05-07T19:52:19.9087666Z -- 2025-05-07T19:52:19.9546993Z CMake Deprecation Warning at CMakeLists.txt:1 (cmake_minimum_required): 2025-05-07T19:52:19.9548721Z Compatibility with CMake < 3.10 will be removed from a future version of 2025-05-07T19:52:19.9549926Z CMake. 2025-05-07T19:52:19.9550286Z 2025-05-07T19:52:19.9550958Z Update the VERSION argument value. Or, use the ... syntax 2025-05-07T19:52:19.9552356Z to tell CMake that the project requires at least but has been updated 2025-05-07T19:52:19.9552939Z to work with policies introduced by or earlier. 2025-05-07T19:52:19.9553367Z 2025-05-07T19:52:19.9553373Z 2025-05-07T19:52:19.9553560Z Not searching for unused variables given on the command line. 2025-05-07T19:52:20.0424847Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:52:20.0514709Z -- Detecting C compiler ABI info 2025-05-07T19:52:20.1821292Z -- Detecting C compiler ABI info - done 2025-05-07T19:52:20.1952443Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:52:20.1954245Z -- Detecting C compile features 2025-05-07T19:52:20.1955237Z -- Detecting C compile features - done 2025-05-07T19:52:20.3449092Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:52:20.3517137Z -- Detecting CXX compiler ABI info 2025-05-07T19:52:20.4907190Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:52:20.5038542Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:52:20.5040236Z -- Detecting CXX compile features 2025-05-07T19:52:20.5049864Z -- Detecting CXX compile features - done 2025-05-07T19:52:20.5065417Z -- Configuring done (0.6s) 2025-05-07T19:52:20.5115402Z -- Generating done (0.0s) 2025-05-07T19:52:20.5125316Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_cmake_test_compile/build 2025-05-07T19:52:20.5171586Z -- 2025-05-07T19:52:20.5171974Z ------- 2025-05-07T19:52:20.5172306Z ------------ 2025-05-07T19:52:20.5172618Z ----------------- 2025-05-07T19:52:20.5172957Z ---------------------- 2025-05-07T19:52:20.5173327Z --------------------------- 2025-05-07T19:52:20.5173727Z -------------------------------- 2025-05-07T19:52:20.5174195Z -- Trying 'Ninja' generator - success 2025-05-07T19:52:20.5174846Z -------------------------------------------------------------------------------- 2025-05-07T19:52:20.5175293Z 2025-05-07T19:52:20.5190379Z Configuring Project 2025-05-07T19:52:20.5190775Z Working directory: 2025-05-07T19:52:20.5191229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build 2025-05-07T19:52:20.5191644Z Command: 2025-05-07T19:52:20.5211363Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/cmake/data/bin/cmake /__w/FBGEMM/FBGEMM/fbgemm_gpu -G Ninja -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja --no-warn-unused-cli -DCMAKE_INSTALL_PREFIX:PATH=/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install -DPYTHON_VERSION_STRING:STRING=3.10.17 -DSKBUILD:INTERNAL=TRUE -DCMAKE_MODULE_PATH:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/skbuild/resources/cmake -DPYTHON_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPYTHON_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPYTHON_LIBRARY:PATH=/github/home/miniconda/envs/build_binary/lib/libpython3.10.so -DPython_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython_FIND_REGISTRY:STRING=NEVER -DPython_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPython_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/numpy/_core/include -DPython3_EXECUTABLE:PATH=/github/home/miniconda/envs/build_binary/bin/python -DPython3_ROOT_DIR:PATH=/github/home/miniconda/envs/build_binary -DPython3_FIND_REGISTRY:STRING=NEVER -DPython3_INCLUDE_DIR:PATH=/github/home/miniconda/envs/build_binary/include/python3.10 -DPython3_NumPy_INCLUDE_DIRS:PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/numpy/_core/include -DCMAKE_MAKE_PROGRAM:FILEPATH=/github/home/miniconda/envs/build_binary/bin/ninja -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release -DCMAKE_PREFIX_PATH=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch -D_GLIBCXX_USE_CXX11_ABI=1 -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_EXPORT_COMPILE_COMMANDS=TRUE -DFBGEMM_BUILD_TARGET=default -DFBGEMM_BUILD_VARIANT=cuda -DNVML_LIB_PATH=/github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -DNCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -DNCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 -DCMAKE_C_COMPILER=/github/home/miniconda/envs/build_binary/bin/cc -DCMAKE_CXX_COMPILER=/github/home/miniconda/envs/build_binary/bin/c++ '-DCMAKE_C_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DCMAKE_CXX_FLAGS='"'"'-DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include'"'"'' '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 '-DTORCH_CUDA_ARCH_LIST=7.0;8.0' -DCMAKE_CXX_STANDARD=17 -DCMAKE_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ar -DCMAKE_CXX_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_C_COMPILER_AR=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ar -DCMAKE_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ranlib -DCMAKE_CXX_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_C_COMPILER_RANLIB=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-gcc-ranlib -DCMAKE_LINKER=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-ld -DCMAKE_STRIP=/github/home/miniconda/envs/build_binary/bin/x86_64-conda-linux-gnu-strip -DCMAKE_BUILD_TYPE=Release 2025-05-07T19:52:20.5230552Z 2025-05-07T19:52:20.5669908Z 2025-05-07T19:52:20.5669923Z 2025-05-07T19:52:20.5670452Z ================================================================================ 2025-05-07T19:52:20.5671322Z Not searching for unused variables given on the command line. 2025-05-07T19:52:20.5671863Z Default C compiler flags 2025-05-07T19:52:20.5672242Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:52:20.5672548Z 2025-05-07T19:52:20.5673569Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:52:20.5674649Z ================================================================================ 2025-05-07T19:52:20.5674906Z 2025-05-07T19:52:20.5675064Z 2025-05-07T19:52:20.5675068Z 2025-05-07T19:52:20.5675184Z ================================================================================ 2025-05-07T19:52:20.5675519Z Default C++ compiler flags 2025-05-07T19:52:20.5675898Z (values may be overridden by CMAKE_CXX_STANDARD and CXX_STANDARD): 2025-05-07T19:52:20.5676206Z 2025-05-07T19:52:20.5677081Z -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include 2025-05-07T19:52:20.5678140Z ================================================================================ 2025-05-07T19:52:20.5678394Z 2025-05-07T19:52:20.5678398Z 2025-05-07T19:52:20.5678402Z 2025-05-07T19:52:20.5678518Z ================================================================================ 2025-05-07T19:52:20.5678855Z AVX2_FLAGS: 2025-05-07T19:52:20.5678978Z 2025-05-07T19:52:20.5679061Z -mavx2 2025-05-07T19:52:20.5679268Z -mf16c 2025-05-07T19:52:20.5679460Z -mfma 2025-05-07T19:52:20.5679662Z -fopenmp 2025-05-07T19:52:20.5679888Z ================================================================================ 2025-05-07T19:52:20.5680136Z 2025-05-07T19:52:20.5680139Z 2025-05-07T19:52:20.5680143Z 2025-05-07T19:52:20.5680258Z ================================================================================ 2025-05-07T19:52:20.5680590Z AVX512_FLAGS: 2025-05-07T19:52:20.5680717Z 2025-05-07T19:52:20.5680796Z -mavx2 2025-05-07T19:52:20.5680994Z -mf16c 2025-05-07T19:52:20.5681179Z -mfma 2025-05-07T19:52:20.5681381Z -mavx512f 2025-05-07T19:52:20.5681580Z -mavx512bw 2025-05-07T19:52:20.5681792Z -mavx512dq 2025-05-07T19:52:20.5681983Z -mavx512vl 2025-05-07T19:52:20.5682186Z -fopenmp 2025-05-07T19:52:20.5682410Z ================================================================================ 2025-05-07T19:52:20.5682656Z 2025-05-07T19:52:20.5682660Z 2025-05-07T19:52:20.5682663Z 2025-05-07T19:52:20.5682778Z ================================================================================ 2025-05-07T19:52:20.5683138Z The project is built using scikit-build 2025-05-07T19:52:20.5683464Z ================================================================================ 2025-05-07T19:52:20.5683707Z 2025-05-07T19:52:20.5683711Z 2025-05-07T19:52:20.5683714Z 2025-05-07T19:52:20.5683835Z ================================================================================ 2025-05-07T19:52:20.5684147Z Build Settings 2025-05-07T19:52:20.5684293Z 2025-05-07T19:52:20.5684397Z FBGEMM_BUILD_TARGET : default 2025-05-07T19:52:20.5684699Z FBGEMM_BUILD_VARIANT : cuda 2025-05-07T19:52:20.5684999Z 2025-05-07T19:52:20.5685092Z NVCC_VERBOSE : 2025-05-07T19:52:20.5685359Z CUDNN_INCLUDE_DIR : 2025-05-07T19:52:20.5685603Z CUDNN_LIBRARY : 2025-05-07T19:52:20.5686033Z NVML_LIB_PATH : /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:20.5686506Z TORCH_CUDA_ARCH_LIST : 7.0 2025-05-07T19:52:20.5686769Z 8.0 2025-05-07T19:52:20.5686878Z 2025-05-07T19:52:20.5686988Z HIP_ROOT_DIR : 2025-05-07T19:52:20.5687233Z HIPCC_VERBOSE : 2025-05-07T19:52:20.5687498Z AMDGPU_TARGETS : 2025-05-07T19:52:20.5687745Z PYTORCH_ROCM_ARCH : 2025-05-07T19:52:20.5688026Z ================================================================================ 2025-05-07T19:52:20.5688256Z 2025-05-07T19:52:20.7199056Z -- The CXX compiler identification is Clang 16.0.6 2025-05-07T19:52:20.7902565Z -- The C compiler identification is Clang 16.0.6 2025-05-07T19:52:21.7299826Z -- The CUDA compiler identification is NVIDIA 11.8.89 with host compiler Clang 16.0.6 2025-05-07T19:52:21.7409344Z -- Detecting CXX compiler ABI info 2025-05-07T19:52:21.8660103Z -- Detecting CXX compiler ABI info - done 2025-05-07T19:52:21.8794620Z -- Check for working CXX compiler: /github/home/miniconda/envs/build_binary/bin/c++ - skipped 2025-05-07T19:52:21.8795170Z -- Detecting CXX compile features 2025-05-07T19:52:21.8802349Z -- Detecting CXX compile features - done 2025-05-07T19:52:21.8876304Z -- Detecting C compiler ABI info 2025-05-07T19:52:22.0049930Z -- Detecting C compiler ABI info - done 2025-05-07T19:52:22.0182883Z -- Check for working C compiler: /github/home/miniconda/envs/build_binary/bin/cc - skipped 2025-05-07T19:52:22.0183479Z -- Detecting C compile features 2025-05-07T19:52:22.0187650Z -- Detecting C compile features - done 2025-05-07T19:52:22.0234249Z -- Detecting CUDA compiler ABI info 2025-05-07T19:52:22.9209334Z -- Detecting CUDA compiler ABI info - done 2025-05-07T19:52:22.9674966Z -- Check for working CUDA compiler: /github/home/miniconda/envs/build_binary/bin/nvcc - skipped 2025-05-07T19:52:22.9707055Z -- Detecting CUDA compile features 2025-05-07T19:52:22.9708374Z -- Detecting CUDA compile features - done 2025-05-07T19:52:22.9734060Z -- Performing Test C_HAS_AVX_1 2025-05-07T19:52:23.2658430Z -- Performing Test C_HAS_AVX_1 - Failed 2025-05-07T19:52:23.2658976Z -- Performing Test C_HAS_AVX_2 2025-05-07T19:52:23.5982465Z -- Performing Test C_HAS_AVX_2 - Success 2025-05-07T19:52:23.5983491Z -- Performing Test C_HAS_AVX2_1 2025-05-07T19:52:23.8853458Z -- Performing Test C_HAS_AVX2_1 - Failed 2025-05-07T19:52:23.8853854Z -- Performing Test C_HAS_AVX2_2 2025-05-07T19:52:24.2178426Z -- Performing Test C_HAS_AVX2_2 - Success 2025-05-07T19:52:24.2179474Z -- Performing Test C_HAS_AVX512_1 2025-05-07T19:52:24.5077680Z -- Performing Test C_HAS_AVX512_1 - Failed 2025-05-07T19:52:24.5078099Z -- Performing Test C_HAS_AVX512_2 2025-05-07T19:52:24.8415727Z -- Performing Test C_HAS_AVX512_2 - Success 2025-05-07T19:52:24.8416724Z -- Performing Test CXX_HAS_AVX_1 2025-05-07T19:52:25.1293332Z -- Performing Test CXX_HAS_AVX_1 - Failed 2025-05-07T19:52:25.1294392Z -- Performing Test CXX_HAS_AVX_2 2025-05-07T19:52:25.4634498Z -- Performing Test CXX_HAS_AVX_2 - Success 2025-05-07T19:52:25.4635572Z -- Performing Test CXX_HAS_AVX2_1 2025-05-07T19:52:25.7516010Z -- Performing Test CXX_HAS_AVX2_1 - Failed 2025-05-07T19:52:25.7517071Z -- Performing Test CXX_HAS_AVX2_2 2025-05-07T19:52:26.0842949Z -- Performing Test CXX_HAS_AVX2_2 - Success 2025-05-07T19:52:26.0844768Z -- Performing Test CXX_HAS_AVX512_1 2025-05-07T19:52:26.3720635Z -- Performing Test CXX_HAS_AVX512_1 - Failed 2025-05-07T19:52:26.3723150Z -- Performing Test CXX_HAS_AVX512_2 2025-05-07T19:52:26.7069240Z -- Performing Test CXX_HAS_AVX512_2 - Success 2025-05-07T19:52:26.7261636Z -- Found CUDA: /github/home/miniconda/envs/build_binary (found version "11.8") 2025-05-07T19:52:26.7297437Z -- Found CUDAToolkit: /github/home/miniconda/envs/build_binary/include (found version "11.8.89") 2025-05-07T19:52:26.7378790Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD 2025-05-07T19:52:26.8687575Z -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success 2025-05-07T19:52:26.8699739Z -- Found Threads: TRUE 2025-05-07T19:52:26.9484975Z -- PyTorch: CUDA detected: 11.8 2025-05-07T19:52:26.9486275Z -- PyTorch: CUDA nvcc is: /github/home/miniconda/envs/build_binary/bin/nvcc 2025-05-07T19:52:26.9487305Z -- PyTorch: CUDA toolkit directory: /github/home/miniconda/envs/build_binary 2025-05-07T19:52:27.1063783Z -- PyTorch: Header version is: 11.8 2025-05-07T19:52:27.1893999Z -- Found Python: /github/home/miniconda/envs/build_binary/bin/python (found version "3.10.17") found components: Interpreter 2025-05-07T19:52:27.1907108Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/public/cuda.cmake:140 (message): 2025-05-07T19:52:27.1908030Z -- USE_CUDNN is set to 0. Compiling without cuDNN support 2025-05-07T19:52:27.1908529Z -- USE_CUSPARSELT is set to 0. Compiling without cuSPARSELt support 2025-05-07T19:52:27.1909001Z -- USE_CUDSS is set to 0. Compiling without cuDSS support 2025-05-07T19:52:27.1909451Z -- USE_CUFILE is set to 0. Compiling without cuFile support 2025-05-07T19:52:27.1909879Z Failed to compute shorthash for libnvrtc.so 2025-05-07T19:52:27.1910242Z Call Stack (most recent call first): 2025-05-07T19:52:27.1910966Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Caffe2/Caffe2Config.cmake:86 (include) 2025-05-07T19:52:27.1912249Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:68 (find_package) 2025-05-07T19:52:27.1913288Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:52:27.1913796Z CMakeLists.txt:112 (include) 2025-05-07T19:52:27.1914005Z 2025-05-07T19:52:27.1914010Z 2025-05-07T19:52:27.1914334Z -- Added CUDA NVCC flags for: -gencode;arch=compute_70,code=sm_70;-gencode;arch=compute_80,code=sm_80 2025-05-07T19:52:27.2244756Z CMake Warning at /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:22 (message): 2025-05-07T19:52:27.2246431Z static library kineto_LIBRARY-NOTFOUND not found. 2025-05-07T19:52:27.2246818Z Call Stack (most recent call first): 2025-05-07T19:52:27.2247694Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/share/cmake/Torch/TorchConfig.cmake:125 (append_torchlib_if_found) 2025-05-07T19:52:27.2248575Z /__w/FBGEMM/FBGEMM/cmake/modules/PyTorchSetup.cmake:14 (find_package) 2025-05-07T19:52:27.2249014Z CMakeLists.txt:112 (include) 2025-05-07T19:52:27.2249190Z 2025-05-07T19:52:27.2249194Z 2025-05-07T19:52:27.2251184Z -- Found Torch: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so 2025-05-07T19:52:27.2251746Z 2025-05-07T19:52:27.2251750Z 2025-05-07T19:52:27.2251877Z ================================================================================ 2025-05-07T19:52:27.2252225Z PyTorch Flags: 2025-05-07T19:52:27.2252452Z 2025-05-07T19:52:27.2252667Z TORCH_INCLUDE_DIRS: 2025-05-07T19:52:27.2253096Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:27.2254341Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:27.2254963Z 2025-05-07T19:52:27.2255224Z TORCH_LIBRARIES: 2025-05-07T19:52:27.2255465Z torch 2025-05-07T19:52:27.2255664Z torch_library 2025-05-07T19:52:27.2256124Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:27.2256725Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:27.2257368Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:27.2257893Z 2025-05-07T19:52:27.2258112Z TORCH_CUDA_OPTIONS: 2025-05-07T19:52:27.2258374Z --expt-relaxed-constexpr 2025-05-07T19:52:27.2258652Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:27.2258965Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:27.2259264Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:27.2259621Z ================================================================================ 2025-05-07T19:52:27.2259860Z 2025-05-07T19:52:27.2259883Z 2025-05-07T19:52:27.2259887Z 2025-05-07T19:52:27.2260022Z ================================================================================ 2025-05-07T19:52:27.2260347Z NCCL Flags 2025-05-07T19:52:27.2260469Z 2025-05-07T19:52:27.2260882Z NCCL_INCLUDE_DIRS=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:27.2261794Z NCCL_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:27.2262706Z ================================================================================ 2025-05-07T19:52:27.2262945Z 2025-05-07T19:52:27.2262949Z 2025-05-07T19:52:27.2262953Z 2025-05-07T19:52:27.2263082Z ================================================================================ 2025-05-07T19:52:27.2263407Z CUDA Driver Path 2025-05-07T19:52:27.2263546Z 2025-05-07T19:52:27.2263840Z CUDA_DRIVER_LIBRARIES=/github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:27.2264342Z ================================================================================ 2025-05-07T19:52:27.2264592Z 2025-05-07T19:52:27.2264987Z -- Found NVML_LIB_PATH: /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:27.2284475Z 2025-05-07T19:52:27.2284493Z 2025-05-07T19:52:27.2285288Z ================================================================================ 2025-05-07T19:52:27.2286446Z GPU CPP Library Target: asmjit (SHARED) 2025-05-07T19:52:27.2287346Z 2025-05-07T19:52:27.2287566Z CPU_SRCS: 2025-05-07T19:52:27.2287705Z 2025-05-07T19:52:27.2287808Z 2025-05-07T19:52:27.2288016Z GPU_SRCS: 2025-05-07T19:52:27.2288132Z 2025-05-07T19:52:27.2288219Z 2025-05-07T19:52:27.2288434Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:27.2288581Z 2025-05-07T19:52:27.2288662Z 2025-05-07T19:52:27.2288873Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:27.2289016Z 2025-05-07T19:52:27.2289098Z 2025-05-07T19:52:27.2289300Z OTHER_SRCS: 2025-05-07T19:52:27.2289691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:52:27.2290335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:52:27.2290974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:52:27.2291598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:52:27.2292239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:52:27.2292848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:52:27.2293500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:52:27.2294123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:52:27.2294718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:52:27.2295446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:52:27.2296035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:52:27.2296665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:52:27.2297282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:52:27.2298043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:52:27.2298661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:52:27.2299267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:52:27.2299883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:52:27.2300610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:52:27.2301205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:52:27.2301817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:52:27.2302419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:52:27.2303045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:52:27.2303669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:52:27.2304307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:52:27.2305127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:52:27.2305720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:52:27.2306341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:52:27.2306958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:52:27.2307548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:52:27.2308250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:52:27.2308912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:52:27.2309538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:52:27.2310115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:52:27.2310705Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:52:27.2311268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:52:27.2311845Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:52:27.2312428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:52:27.2313344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:52:27.2314046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:52:27.2314634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:52:27.2315234Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:52:27.2315800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:52:27.2316387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:52:27.2316977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:52:27.2317552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:52:27.2318161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:52:27.2318756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:52:27.2319483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:52:27.2320066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:52:27.2320678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:52:27.2321290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:52:27.2321875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:52:27.2322493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:52:27.2323093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:52:27.2323685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:52:27.2324269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:52:27.2324851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:52:27.2325442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:52:27.2326019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:52:27.2326466Z 2025-05-07T19:52:27.2326654Z CC_FLAGS: 2025-05-07T19:52:27.2326792Z 2025-05-07T19:52:27.2326872Z 2025-05-07T19:52:27.2327054Z NVCC_FLAGS: 2025-05-07T19:52:27.2327190Z 2025-05-07T19:52:27.2327269Z 2025-05-07T19:52:27.2327476Z HIPCC_FLAGS: 2025-05-07T19:52:27.2327605Z 2025-05-07T19:52:27.2327684Z 2025-05-07T19:52:27.2327982Z INCLUDE_DIRS: 2025-05-07T19:52:27.2328237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:27.2328577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:27.2329067Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:27.2329383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:27.2329908Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:27.2330704Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:27.2331604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:27.2332137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:27.2332605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:27.2333106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:27.2333638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:27.2334136Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:27.2334710Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:27.2335241Z 2025-05-07T19:52:27.2335452Z Selected Source Files: 2025-05-07T19:52:27.2335641Z 2025-05-07T19:52:27.2335727Z 2025-05-07T19:52:27.2335935Z HIPified Source Files: 2025-05-07T19:52:27.2336118Z 2025-05-07T19:52:27.2336202Z 2025-05-07T19:52:27.2336428Z Library Dependencies: 2025-05-07T19:52:27.2336671Z torch 2025-05-07T19:52:27.2336892Z torch_library 2025-05-07T19:52:27.2337342Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:27.2338090Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:27.2338696Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:27.2339507Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:27.2340161Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:27.2340685Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:27.2341100Z 2025-05-07T19:52:27.2341289Z Output Library: 2025-05-07T19:52:27.2341514Z asmjit 2025-05-07T19:52:27.2341700Z 2025-05-07T19:52:27.2341907Z Destination Directory: 2025-05-07T19:52:27.2342143Z fbgemm_gpu 2025-05-07T19:52:27.2342388Z ================================================================================ 2025-05-07T19:52:27.2342622Z 2025-05-07T19:52:27.2342660Z 2025-05-07T19:52:27.2342663Z 2025-05-07T19:52:27.2342797Z ================================================================================ 2025-05-07T19:52:27.2343147Z GPU CPP Library Target: fbgemm (SHARED) 2025-05-07T19:52:27.2343456Z 2025-05-07T19:52:27.2343641Z CPU_SRCS: 2025-05-07T19:52:27.2343777Z 2025-05-07T19:52:27.2343856Z 2025-05-07T19:52:27.2344063Z GPU_SRCS: 2025-05-07T19:52:27.2344176Z 2025-05-07T19:52:27.2344255Z 2025-05-07T19:52:27.2344465Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:27.2344608Z 2025-05-07T19:52:27.2344690Z 2025-05-07T19:52:27.2344895Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:27.2345037Z 2025-05-07T19:52:27.2345115Z 2025-05-07T19:52:27.2345309Z OTHER_SRCS: 2025-05-07T19:52:27.2345575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDM.cc 2025-05-07T19:52:27.2346033Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:52:27.2346495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:27.2346916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtils.cc 2025-05-07T19:52:27.2347342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RefImplementations.cc 2025-05-07T19:52:27.2347820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/RowWiseSparseAdagradFused.cc 2025-05-07T19:52:27.2348291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/SparseAdagrad.cc 2025-05-07T19:52:27.2348664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/Utils.cc 2025-05-07T19:52:27.2349068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:27.2349643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:52:27.2350091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:27.2350529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/QuantUtilsAvx2.cc 2025-05-07T19:52:27.2350959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:52:27.2351341Z 2025-05-07T19:52:27.2351530Z CC_FLAGS: 2025-05-07T19:52:27.2351667Z 2025-05-07T19:52:27.2351746Z 2025-05-07T19:52:27.2351931Z NVCC_FLAGS: 2025-05-07T19:52:27.2352068Z 2025-05-07T19:52:27.2352146Z 2025-05-07T19:52:27.2352332Z HIPCC_FLAGS: 2025-05-07T19:52:27.2352646Z 2025-05-07T19:52:27.2352744Z 2025-05-07T19:52:27.2352949Z INCLUDE_DIRS: 2025-05-07T19:52:27.2353372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:27.2353712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:27.2354018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:27.2354353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:27.2354860Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:27.2355672Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:27.2356329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:27.2356763Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:27.2357214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:27.2357688Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:27.2358233Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:27.2358701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:27.2359282Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:27.2359790Z 2025-05-07T19:52:27.2360005Z Selected Source Files: 2025-05-07T19:52:27.2360164Z 2025-05-07T19:52:27.2360260Z 2025-05-07T19:52:27.2360457Z HIPified Source Files: 2025-05-07T19:52:27.2360612Z 2025-05-07T19:52:27.2360707Z 2025-05-07T19:52:27.2360902Z Library Dependencies: 2025-05-07T19:52:27.2361152Z torch 2025-05-07T19:52:27.2361348Z torch_library 2025-05-07T19:52:27.2361802Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:27.2362406Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:27.2363039Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:27.2363845Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:27.2364528Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:27.2364926Z asmjit 2025-05-07T19:52:27.2365377Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:27.2365771Z 2025-05-07T19:52:27.2365951Z Output Library: 2025-05-07T19:52:27.2366167Z fbgemm 2025-05-07T19:52:27.2366353Z 2025-05-07T19:52:27.2366552Z Destination Directory: 2025-05-07T19:52:27.2366776Z fbgemm_gpu 2025-05-07T19:52:27.2367016Z ================================================================================ 2025-05-07T19:52:27.2367239Z 2025-05-07T19:52:27.2367243Z 2025-05-07T19:52:27.2367248Z 2025-05-07T19:52:27.2367380Z ================================================================================ 2025-05-07T19:52:27.2367708Z Running code generation script ... 2025-05-07T19:52:27.2368435Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py --opensource 2025-05-07T19:52:27.2369165Z ================================================================================ 2025-05-07T19:52:27.2369405Z 2025-05-07T19:52:27.8476256Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:27.8477504Z [GENERAATE BACKWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_backward_split.py', '--opensource'] 2025-05-07T19:52:27.8478293Z Written: gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:27.8478787Z Written: gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:27.8479303Z Written: gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.8479828Z Written: gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.8480347Z Written: gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:27.8480848Z Written: gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:27.8481470Z Written: gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:27.8482000Z Written: gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.8482533Z Written: gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.8483063Z Written: gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:27.8483591Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.8484140Z Written: gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.8484802Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.8485381Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.8485938Z Written: gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.8486465Z Written: gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.8487117Z Written: gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.8487631Z Written: gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.8488186Z Written: gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.8488706Z Written: gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.8510330Z Written: gen_embedding_optimizer_dense_split_device_kernel.cuh 2025-05-07T19:52:27.8510930Z Written: gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:27.8511369Z Written: gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:27.8511829Z Written: gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:27.8512369Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.8513207Z Written: gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:27.8513751Z Written: gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:27.8514276Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.8514843Z Written: gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:27.8515381Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.8515952Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.8516551Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.8517097Z Written: gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.8517684Z Written: gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.8518269Z Written: gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.8518821Z Written: gen_embedding_optimizer_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.8519292Z Written: gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:27.8519704Z Written: gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.8520199Z Written: gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.8520623Z Written: lookup_adagrad.py 2025-05-07T19:52:27.8520977Z Written: gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:27.8521404Z Written: gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:27.8521885Z Written: gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.8522582Z Written: gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:27.8523080Z Written: gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:27.8523587Z Written: gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.8524098Z Written: gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.8524607Z Written: gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:27.8525090Z Written: gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:27.8525707Z Written: gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:27.8526298Z Written: gen_embedding_backward_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.8526825Z Written: gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.8527323Z Written: gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:27.8527825Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.8528352Z Written: gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.8528878Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.8529563Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.8530066Z Written: gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.8530578Z Written: gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.8531088Z Written: gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.8532026Z Written: gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.8532625Z Written: gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.8533174Z Written: gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.8533701Z Written: gen_embedding_optimizer_adam_split_device_kernel.cuh 2025-05-07T19:52:27.8534141Z Written: gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:27.8534541Z Written: gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:27.8535009Z Written: gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.8535413Z Written: lookup_adam.py 2025-05-07T19:52:27.8535730Z Written: gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:27.8536171Z Written: gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.8536662Z Written: gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:27.8537158Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.8537683Z Written: gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:27.8538176Z Written: gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:27.8538674Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.8539202Z Written: gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:27.8539703Z Written: gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.8540257Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.8540813Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.8541353Z Written: gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.8541913Z Written: gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.8542465Z Written: gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.8542988Z Written: gen_embedding_optimizer_lamb_split_device_kernel.cuh 2025-05-07T19:52:27.8543427Z Written: gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:27.8543829Z Written: gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:27.8544372Z Written: gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.8544770Z Written: lookup_lamb.py 2025-05-07T19:52:27.8545221Z Written: gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:27.8545658Z Written: gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.8546129Z Written: gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:27.8546610Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.8547123Z Written: gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:27.8547587Z Written: gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:27.8548098Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.8548602Z Written: gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:27.8549199Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.8549744Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.8550280Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.8550810Z Written: gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.8551339Z Written: gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.8551901Z Written: gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.8552395Z Written: gen_embedding_optimizer_lars_sgd_split_device_kernel.cuh 2025-05-07T19:52:27.8552936Z Written: gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:27.8553542Z Written: gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:27.8554038Z Written: gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.8554484Z Written: lookup_lars_sgd.py 2025-05-07T19:52:27.8554820Z Written: gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:27.8555308Z Written: gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.8555863Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:27.8556507Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.8557152Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:27.8557753Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:27.8558394Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.8559044Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:27.8559715Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.8560405Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.8561131Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.8561801Z Written: gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.8562478Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.8563198Z Written: gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.9543748Z Written: gen_embedding_optimizer_partial_rowwise_adam_split_device_kernel.cuh 2025-05-07T19:52:27.9545474Z Written: gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:27.9547047Z Written: gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:27.9548717Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.9549882Z Written: lookup_partial_rowwise_adam.py 2025-05-07T19:52:27.9550374Z Written: gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:27.9550946Z Written: gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.9551572Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:27.9552481Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.9553272Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:27.9553971Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:27.9554599Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.9555248Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:27.9555881Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.9556547Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.9557356Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.9558001Z Written: gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.9558696Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.9559522Z Written: gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.9560116Z Written: gen_embedding_optimizer_partial_rowwise_lamb_split_device_kernel.cuh 2025-05-07T19:52:27.9560643Z Written: gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:27.9561106Z Written: gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:27.9561643Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.9562097Z Written: lookup_partial_rowwise_lamb.py 2025-05-07T19:52:27.9562504Z Written: gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:27.9563039Z Written: gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.9563587Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:27.9564128Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:27.9564639Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:27.9565149Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:27.9565671Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.9566236Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:27.9566796Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.9567331Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:27.9567874Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:27.9568385Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:27.9568926Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:27.9569458Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:27.9569989Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_meta.cpp 2025-05-07T19:52:27.9570513Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:27.9571043Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.9571626Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:27.9572181Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.9572742Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:27.9573280Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_meta.cpp 2025-05-07T19:52:27.9573813Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:27.9574371Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.9575072Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.9575651Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:27.9576185Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:27.9576762Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.9577375Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:27.9577965Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.9578564Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:27.9579581Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:27.9580145Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:27.9580702Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.9581302Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.9581882Z Written: gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:27.9582426Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:27.9583015Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.9583620Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:27.9584236Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.9584840Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:27.9585414Z Written: gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:27.9585992Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:27.9586585Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:27.9587196Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:27.9587790Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:27.9588415Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:27.9589035Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:27.9589631Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:27.9590264Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:27.9590878Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:27.9591481Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:27.9592060Z Written: gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:27.9592703Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:27.9593490Z Written: gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:27.9594064Z Written: gen_embedding_optimizer_rowwise_adagrad_ssd_device_kernel.cuh 2025-05-07T19:52:27.9594642Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.9595141Z Written: gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:27.9595609Z Written: gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.9596143Z Written: gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.9596606Z Written: lookup_rowwise_adagrad_ssd.py 2025-05-07T19:52:27.9597013Z Written: gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:27.9597479Z Written: gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.9598139Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.9598599Z Written: lookup_rowwise_adagrad.py 2025-05-07T19:52:27.9599002Z Written: gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:27.9599585Z Written: gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:27.9600088Z Written: gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.9600663Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:27.9601188Z Written: gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:27.9601758Z Written: gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:27.9602294Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.9602854Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:27.9603405Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.9604019Z Written: gen_embedding_optimizer_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:52:27.9604632Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:27.9605182Z Written: gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:27.9605810Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:27.9606425Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:27.9607050Z Written: gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:27.9607749Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_weight_decay_split_device_kernel.cuh 2025-05-07T19:52:27.9608400Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:27.9609020Z Written: gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:27.9609691Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0791609Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:28.0794143Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0796269Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.0798221Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:28.0800240Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.0802095Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.0802743Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:28.0803369Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.0804003Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:28.0804651Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:28.0805308Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.0805962Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:28.0806612Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.0807290Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.0807980Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.0808915Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.0809620Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.0810290Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.0810971Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.0811651Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.0812371Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.0813150Z Written: gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.0813790Z Written: gen_embedding_optimizer_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:52:28.0814377Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:28.0814904Z Written: gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:28.0815501Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0816025Z Written: lookup_rowwise_adagrad_with_counter.py 2025-05-07T19:52:28.0816468Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:28.0817063Z Written: gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0817707Z Written: gen_embedding_optimizer_approx_rowwise_adagrad_with_counter_split_device_kernel.cuh 2025-05-07T19:52:28.0818341Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:28.0818911Z Written: gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:28.0819554Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0820207Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:28.0820835Z Written: gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0821479Z Written: gen_embedding_optimizer_rowwise_weighted_adagrad_split_device_kernel.cuh 2025-05-07T19:52:28.0822009Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:28.0822514Z Written: gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:28.0823080Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0823633Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:28.0824198Z Written: gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0824719Z Written: gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:28.0825167Z Written: gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:28.0825612Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.0826090Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:28.0826534Z Written: gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:28.0826981Z Written: gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:28.0827434Z Written: gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:28.0827886Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:28.0828381Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:28.0828835Z Written: gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:28.0829318Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.0829791Z Written: gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.0830296Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.0830909Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:28.0831822Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.0832366Z Written: gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.0832969Z Written: gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.0833575Z Written: gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.0834157Z Written: gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:28.0834735Z Written: gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.0835395Z Written: gen_embedding_optimizer_sgd_split_device_kernel.cuh 2025-05-07T19:52:28.0835837Z Written: gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:28.0836250Z Written: gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.0836686Z Written: gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0837091Z Written: lookup_sgd.py 2025-05-07T19:52:28.0837384Z Written: gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:28.0837815Z Written: gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:28.0838264Z Written: gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0838809Z Written: gen_embedding_optimizer_approx_sgd_split_device_kernel.cuh 2025-05-07T19:52:28.0839322Z Written: gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:28.0839761Z Written: gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:28.0840288Z Written: gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0840762Z Written: gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:28.0841262Z Written: gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0841756Z Written: gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:28.0842268Z Written: gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:28.0842787Z Written: gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:28.0843258Z Written: gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:28.0843772Z Written: gen_embedding_backward_none_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:28.0844276Z Written: gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:28.0844783Z Written: gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:28.0845431Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:28.0845954Z Written: gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:28.0846451Z Written: gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:28.0846953Z Written: gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:28.0847487Z Written: gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:28.0847954Z Written: gen_embedding_optimizer_none_split_device_kernel.cuh 2025-05-07T19:52:28.0848370Z Written: gen_embedding_backward_split_none.cpp 2025-05-07T19:52:28.0848724Z Written: gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:28.0849149Z Written: gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.0849534Z Written: lookup_none.py 2025-05-07T19:52:28.0849811Z Written: gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:28.0850230Z Written: gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.0850699Z Written: gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:52:28.0851231Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:52:28.0851767Z Written: gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:52:28.0852268Z Written: gen_embedding_backward_ssd_weighted_vbe_device_kernel.cuh 2025-05-07T19:52:28.0852758Z Written: gen_embedding_backward_split_weighted_vbe_device_kernel.cuh 2025-05-07T19:52:28.0853331Z Written: gen_embedding_backward_ssd_weighted_device_kernel.cuh 2025-05-07T19:52:28.0853803Z Written: gen_embedding_backward_split_weighted_device_kernel.cuh 2025-05-07T19:52:28.0854286Z Written: gen_embedding_backward_ssd_unweighted_nobag_device_kernel.cuh 2025-05-07T19:52:28.0854817Z Written: gen_embedding_backward_split_unweighted_nobag_device_kernel.cuh 2025-05-07T19:52:28.0855323Z Written: gen_embedding_backward_ssd_unweighted_vbe_device_kernel.cuh 2025-05-07T19:52:28.0855851Z Written: gen_embedding_backward_split_unweighted_vbe_device_kernel.cuh 2025-05-07T19:52:28.0856353Z Written: gen_embedding_backward_ssd_unweighted_device_kernel.cuh 2025-05-07T19:52:28.0856884Z Written: gen_embedding_backward_split_unweighted_device_kernel.cuh 2025-05-07T19:52:28.0857357Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:52:28.0857796Z Written: gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:28.0858267Z Written: gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.0858746Z Written: gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.0859243Z Written: gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:28.0859648Z Written: pt2_arg_utils.h 2025-05-07T19:52:28.0859889Z Written: __init__.py 2025-05-07T19:52:28.0860140Z Written: lookup_args_ssd.py 2025-05-07T19:52:28.0860394Z Written: lookup_args.py 2025-05-07T19:52:28.0899997Z 2025-05-07T19:52:28.0901690Z 2025-05-07T19:52:28.0902411Z ================================================================================ 2025-05-07T19:52:28.0903540Z Running code generation script ... 2025-05-07T19:52:28.0905941Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py --opensource 2025-05-07T19:52:28.0907342Z ================================================================================ 2025-05-07T19:52:28.0907568Z 2025-05-07T19:52:28.1969654Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:28.1970580Z [GENERATE OPTIMIZERS]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_embedding_optimizer.py', '--opensource'] 2025-05-07T19:52:28.1971355Z Written: gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:28.1971844Z Written: gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:28.1972327Z Written: gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:28.1972833Z Written: gen_embedding_optimizer_rowwise_adagrad_split_device_kernel.cuh 2025-05-07T19:52:28.1973342Z Written: split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T19:52:28.1973723Z Written: optimizer_args.py 2025-05-07T19:52:28.2052004Z 2025-05-07T19:52:28.2052022Z 2025-05-07T19:52:28.2052529Z ================================================================================ 2025-05-07T19:52:28.2053632Z Running code generation script ... 2025-05-07T19:52:28.2055974Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py --opensource 2025-05-07T19:52:28.2057924Z ================================================================================ 2025-05-07T19:52:28.2058148Z 2025-05-07T19:52:28.3290665Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:28.3293270Z [GENERATE FORWARD QUANTIZED]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_quantized.py', '--opensource'] 2025-05-07T19:52:28.3295878Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:28.3296832Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:28.3297584Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:28.3298213Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:28.3299050Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:28.3299687Z Written: gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:28.3300352Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:28.3301033Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:28.3301721Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:28.3302399Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:28.3303172Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:28.3303865Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:28.3304521Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:28.3305173Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:28.3305806Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:28.3306452Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:28.3307081Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:28.3307722Z Written: gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:28.3308345Z Written: gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:28.3308947Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:28.3309566Z Written: gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.3310098Z Written: gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:28.3310573Z Written: gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:28.3372408Z 2025-05-07T19:52:28.3372430Z 2025-05-07T19:52:28.3372964Z ================================================================================ 2025-05-07T19:52:28.3373915Z Running code generation script ... 2025-05-07T19:52:28.3374693Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py --opensource 2025-05-07T19:52:28.3375468Z ================================================================================ 2025-05-07T19:52:28.3375738Z 2025-05-07T19:52:28.7244258Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:28.7245735Z [GENERATE FORWARD SPLIT]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_forward_split.py', '--opensource'] 2025-05-07T19:52:28.7246504Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.7246998Z Written: gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:28.7247491Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.7248006Z Written: gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.7248488Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.7249103Z Written: gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.7249574Z Written: gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:28.7250019Z Written: gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:28.7250610Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.7251077Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:28.7251534Z Written: gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.7252185Z Written: gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:28.7252665Z Written: gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:28.7253135Z Written: gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:28.7253626Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:28.7254125Z Written: gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:28.7254594Z Written: gen_embedding_forward_dense_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.7255057Z Written: gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:28.7255607Z Written: gen_embedding_forward_dense_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.7256083Z Written: gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.7256533Z Written: gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.7256996Z Written: gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.7257450Z Written: gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:28.7257878Z Written: gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:28.7258336Z Written: gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.7258803Z Written: gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:28.7259272Z Written: gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.7259714Z Written: gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:28.7260157Z Written: gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:28.7260574Z Written: gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:28.7260978Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.7261422Z Written: gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.7261839Z Written: gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:28.7262246Z Written: gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:28.7262647Z Written: gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:28.7263057Z Written: gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:28.7263447Z Written: gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:28.7263854Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.7264302Z Written: gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:28.7264728Z Written: gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.7265154Z Written: gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:28.7265567Z Written: gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:28.7265966Z Written: gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:28.7266392Z Written: gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:28.7266814Z Written: gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:28.7267259Z Written: gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:28.7267697Z Written: gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:28.7268124Z Written: gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:28.7268531Z Written: gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:28.7268995Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.7269494Z Written: gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.7269971Z Written: gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.7270463Z Written: gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:28.7270904Z Written: gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.7271318Z Written: gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:28.7271703Z Written: gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:28.7343392Z 2025-05-07T19:52:28.7343432Z 2025-05-07T19:52:28.7344491Z ================================================================================ 2025-05-07T19:52:28.7345219Z Running code generation script ... 2025-05-07T19:52:28.7345970Z /github/home/miniconda/envs/build_binary/bin/python /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py --opensource 2025-05-07T19:52:28.7346754Z ================================================================================ 2025-05-07T19:52:28.7346981Z 2025-05-07T19:52:29.0231723Z [ARGS PARSE] Parsed arguments: Namespace(install_dir='.', is_fbcode=False, is_rocm=False) 2025-05-07T19:52:29.0234891Z [INDEX SELECT GENERATOR]: ['/__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/genscript/generate_index_select.py', '--opensource'] 2025-05-07T19:52:29.0236975Z Written: gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:29.0238267Z Written: gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:29.0238713Z Written: gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:29.0239154Z Written: gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:29.0239700Z Written: gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:29.0240114Z Written: gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:29.0240585Z Written: gen_embedding_backward_split_batch_index_select_device_kernel.cuh 2025-05-07T19:52:29.0241054Z Written: gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:29.0241480Z Written: gen_embedding_backward_split_common_device_kernel.cuh 2025-05-07T19:52:29.0321432Z -- Adding merge_pooled_embeddings sources 2025-05-07T19:52:29.0335427Z 2025-05-07T19:52:29.0335886Z 2025-05-07T19:52:29.0336335Z ================================================================================ 2025-05-07T19:52:29.0336766Z GPU CPP Library Target: fbgemm_gpu_tbe_cache (SHARED) 2025-05-07T19:52:29.0337117Z 2025-05-07T19:52:29.0337304Z CPU_SRCS: 2025-05-07T19:52:29.0337720Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:29.0338436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:29.0339084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:29.0339694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:29.0340316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:29.0340792Z 2025-05-07T19:52:29.0340988Z GPU_SRCS: 2025-05-07T19:52:29.0341332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:52:29.0341942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:52:29.0342556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:52:29.0343209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:52:29.0343833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:52:29.0344419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:52:29.0345061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:52:29.0345765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:52:29.0346352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:52:29.0346988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:52:29.0347477Z 2025-05-07T19:52:29.0347694Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.0347842Z 2025-05-07T19:52:29.0347923Z 2025-05-07T19:52:29.0348136Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.0348276Z 2025-05-07T19:52:29.0348356Z 2025-05-07T19:52:29.0348556Z OTHER_SRCS: 2025-05-07T19:52:29.0348676Z 2025-05-07T19:52:29.0348755Z 2025-05-07T19:52:29.0349238Z CC_FLAGS: 2025-05-07T19:52:29.0349361Z 2025-05-07T19:52:29.0349440Z 2025-05-07T19:52:29.0349642Z NVCC_FLAGS: 2025-05-07T19:52:29.0349882Z --expt-relaxed-constexpr 2025-05-07T19:52:29.0350161Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.0350460Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.0350752Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.0351024Z 2025-05-07T19:52:29.0351212Z HIPCC_FLAGS: 2025-05-07T19:52:29.0351355Z 2025-05-07T19:52:29.0351433Z 2025-05-07T19:52:29.0351612Z INCLUDE_DIRS: 2025-05-07T19:52:29.0351856Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.0352167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.0352664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.0353168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.0353750Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.0354570Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.0355227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.0355663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.0356096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.0356591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.0357113Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.0357594Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.0358176Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.0358688Z 2025-05-07T19:52:29.0358916Z Selected Source Files: 2025-05-07T19:52:29.0359347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:29.0360022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:29.0360681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:29.0361304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:29.0361936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:29.0362574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu 2025-05-07T19:52:29.0363184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu 2025-05-07T19:52:29.0363934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu 2025-05-07T19:52:29.0364586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu 2025-05-07T19:52:29.0365206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu 2025-05-07T19:52:29.0365789Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu 2025-05-07T19:52:29.0366424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu 2025-05-07T19:52:29.0367003Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu 2025-05-07T19:52:29.0367591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu 2025-05-07T19:52:29.0368232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu 2025-05-07T19:52:29.0368717Z 2025-05-07T19:52:29.0368931Z HIPified Source Files: 2025-05-07T19:52:29.0369113Z 2025-05-07T19:52:29.0369195Z 2025-05-07T19:52:29.0369407Z Library Dependencies: 2025-05-07T19:52:29.0369638Z torch 2025-05-07T19:52:29.0369850Z torch_library 2025-05-07T19:52:29.0370287Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.0370902Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.0371498Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.0372386Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.0373051Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.0373571Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.0373991Z 2025-05-07T19:52:29.0374184Z Output Library: 2025-05-07T19:52:29.0374430Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:29.0374674Z 2025-05-07T19:52:29.0374872Z Destination Directory: 2025-05-07T19:52:29.0375128Z fbgemm_gpu 2025-05-07T19:52:29.0375362Z ================================================================================ 2025-05-07T19:52:29.0375665Z 2025-05-07T19:52:29.0844361Z 2025-05-07T19:52:29.0844455Z 2025-05-07T19:52:29.0844988Z ================================================================================ 2025-05-07T19:52:29.0846064Z GPU CPP Library Target: fbgemm_gpu_tbe_inference (SHARED) 2025-05-07T19:52:29.0846442Z 2025-05-07T19:52:29.0846665Z CPU_SRCS: 2025-05-07T19:52:29.0846999Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:29.0847496Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:29.0848022Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:29.0848379Z 2025-05-07T19:52:29.0848596Z GPU_SRCS: 2025-05-07T19:52:29.0848883Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:29.0849370Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:52:29.0849935Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:29.0850579Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:29.0851202Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:29.0851948Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:29.0852567Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:29.0853162Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:29.0853796Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:29.0854445Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:29.0855101Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:29.0855762Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:29.0856405Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:29.0857070Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:29.0857692Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:29.0858309Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:29.0858920Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:29.0859532Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:29.0860155Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:29.0860761Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:29.0861347Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:29.0861928Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:29.0862522Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.0862941Z 2025-05-07T19:52:29.0863129Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.0863270Z 2025-05-07T19:52:29.0863360Z 2025-05-07T19:52:29.0863538Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.0863676Z 2025-05-07T19:52:29.0863762Z 2025-05-07T19:52:29.0864181Z OTHER_SRCS: 2025-05-07T19:52:29.0864323Z 2025-05-07T19:52:29.0864401Z 2025-05-07T19:52:29.0864568Z CC_FLAGS: 2025-05-07T19:52:29.0864693Z 2025-05-07T19:52:29.0864770Z 2025-05-07T19:52:29.0864946Z NVCC_FLAGS: 2025-05-07T19:52:29.0865182Z --expt-relaxed-constexpr 2025-05-07T19:52:29.0865460Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.0865730Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.0866031Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.0866273Z 2025-05-07T19:52:29.0866466Z HIPCC_FLAGS: 2025-05-07T19:52:29.0866588Z 2025-05-07T19:52:29.0866663Z 2025-05-07T19:52:29.0866851Z INCLUDE_DIRS: 2025-05-07T19:52:29.0867166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.0867485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.0867759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.0868075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.0868570Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.0869338Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.0869967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.0870366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.0870794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.0871246Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.0871760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.0872229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.0872909Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.0873616Z 2025-05-07T19:52:29.0873798Z Selected Source Files: 2025-05-07T19:52:29.0874145Z codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:29.0874602Z gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:29.0875061Z gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:29.0875494Z codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:29.0875994Z codegen/inference/embedding_forward_quantized_split_lookup.cu 2025-05-07T19:52:29.0876550Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu 2025-05-07T19:52:29.0877156Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu 2025-05-07T19:52:29.0877773Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu 2025-05-07T19:52:29.0878376Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu 2025-05-07T19:52:29.0879003Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu 2025-05-07T19:52:29.0879730Z gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu 2025-05-07T19:52:29.0880346Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu 2025-05-07T19:52:29.0881014Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu 2025-05-07T19:52:29.0881659Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu 2025-05-07T19:52:29.0882319Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu 2025-05-07T19:52:29.0882978Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu 2025-05-07T19:52:29.0883615Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu 2025-05-07T19:52:29.0884249Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu 2025-05-07T19:52:29.0884850Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu 2025-05-07T19:52:29.0885454Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu 2025-05-07T19:52:29.0886136Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu 2025-05-07T19:52:29.0886760Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu 2025-05-07T19:52:29.0887370Z gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu 2025-05-07T19:52:29.0887945Z gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu 2025-05-07T19:52:29.0888518Z gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu 2025-05-07T19:52:29.0889100Z gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.0889522Z 2025-05-07T19:52:29.0889716Z HIPified Source Files: 2025-05-07T19:52:29.0889967Z 2025-05-07T19:52:29.0890045Z 2025-05-07T19:52:29.0890244Z Library Dependencies: 2025-05-07T19:52:29.0890463Z torch 2025-05-07T19:52:29.0890667Z torch_library 2025-05-07T19:52:29.0891092Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.0891683Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.0892277Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.0893070Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.0893728Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.0894092Z asmjit 2025-05-07T19:52:29.0894292Z fbgemm 2025-05-07T19:52:29.0894489Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:29.0894732Z fbgemm_gpu_config 2025-05-07T19:52:29.0895072Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.0895487Z 2025-05-07T19:52:29.0895667Z Output Library: 2025-05-07T19:52:29.0895893Z fbgemm_gpu_tbe_inference 2025-05-07T19:52:29.0896119Z 2025-05-07T19:52:29.0896311Z Destination Directory: 2025-05-07T19:52:29.0896539Z fbgemm_gpu 2025-05-07T19:52:29.0896768Z ================================================================================ 2025-05-07T19:52:29.0897000Z 2025-05-07T19:52:29.3233352Z 2025-05-07T19:52:29.3233509Z 2025-05-07T19:52:29.3234054Z ================================================================================ 2025-05-07T19:52:29.3235238Z GPU CPP Library Target: fbgemm_gpu_config (SHARED) 2025-05-07T19:52:29.3236199Z 2025-05-07T19:52:29.3236718Z CPU_SRCS: 2025-05-07T19:52:29.3237370Z src/config/feature_gates.cpp 2025-05-07T19:52:29.3238100Z 2025-05-07T19:52:29.3238588Z GPU_SRCS: 2025-05-07T19:52:29.3238909Z 2025-05-07T19:52:29.3239113Z 2025-05-07T19:52:29.3239610Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3240089Z 2025-05-07T19:52:29.3240169Z 2025-05-07T19:52:29.3240367Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3240515Z 2025-05-07T19:52:29.3240597Z 2025-05-07T19:52:29.3240784Z OTHER_SRCS: 2025-05-07T19:52:29.3240902Z 2025-05-07T19:52:29.3240974Z 2025-05-07T19:52:29.3241165Z CC_FLAGS: 2025-05-07T19:52:29.3241277Z 2025-05-07T19:52:29.3241351Z 2025-05-07T19:52:29.3241527Z NVCC_FLAGS: 2025-05-07T19:52:29.3241643Z 2025-05-07T19:52:29.3241712Z 2025-05-07T19:52:29.3241905Z HIPCC_FLAGS: 2025-05-07T19:52:29.3242035Z 2025-05-07T19:52:29.3242111Z 2025-05-07T19:52:29.3242303Z INCLUDE_DIRS: 2025-05-07T19:52:29.3242539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3242857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3243152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3243453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3243962Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3244747Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3245419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3245821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3246268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3246741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3247496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3247979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3248534Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3249168Z 2025-05-07T19:52:29.3249349Z Selected Source Files: 2025-05-07T19:52:29.3249602Z src/config/feature_gates.cpp 2025-05-07T19:52:29.3249862Z 2025-05-07T19:52:29.3250222Z HIPified Source Files: 2025-05-07T19:52:29.3250372Z 2025-05-07T19:52:29.3250461Z 2025-05-07T19:52:29.3250648Z Library Dependencies: 2025-05-07T19:52:29.3251007Z torch 2025-05-07T19:52:29.3251201Z torch_library 2025-05-07T19:52:29.3251643Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3252242Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3252862Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3253663Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3254336Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3254855Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3255252Z 2025-05-07T19:52:29.3255455Z Output Library: 2025-05-07T19:52:29.3255671Z fbgemm_gpu_config 2025-05-07T19:52:29.3255892Z 2025-05-07T19:52:29.3256081Z Destination Directory: 2025-05-07T19:52:29.3256332Z fbgemm_gpu 2025-05-07T19:52:29.3256558Z ================================================================================ 2025-05-07T19:52:29.3256812Z 2025-05-07T19:52:29.3256889Z 2025-05-07T19:52:29.3256894Z 2025-05-07T19:52:29.3257022Z ================================================================================ 2025-05-07T19:52:29.3257402Z GPU CPP Library Target: fbgemm_gpu_tbe_utils (SHARED) 2025-05-07T19:52:29.3257752Z 2025-05-07T19:52:29.3257935Z CPU_SRCS: 2025-05-07T19:52:29.3258239Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:29.3258693Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:29.3259176Z 2025-05-07T19:52:29.3259362Z GPU_SRCS: 2025-05-07T19:52:29.3259617Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:29.3260018Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:52:29.3260406Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:52:29.3260801Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:52:29.3261213Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:52:29.3261747Z 2025-05-07T19:52:29.3261965Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3262111Z 2025-05-07T19:52:29.3262194Z 2025-05-07T19:52:29.3262409Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3262554Z 2025-05-07T19:52:29.3262647Z 2025-05-07T19:52:29.3262861Z OTHER_SRCS: 2025-05-07T19:52:29.3262984Z 2025-05-07T19:52:29.3263067Z 2025-05-07T19:52:29.3263280Z CC_FLAGS: 2025-05-07T19:52:29.3263394Z 2025-05-07T19:52:29.3263506Z 2025-05-07T19:52:29.3263702Z NVCC_FLAGS: 2025-05-07T19:52:29.3263846Z 2025-05-07T19:52:29.3263952Z 2025-05-07T19:52:29.3264149Z HIPCC_FLAGS: 2025-05-07T19:52:29.3264279Z 2025-05-07T19:52:29.3264386Z 2025-05-07T19:52:29.3264582Z INCLUDE_DIRS: 2025-05-07T19:52:29.3264842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3265165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3265479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3265802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3266327Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3267160Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3267821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3268284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3268820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3269341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3269992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3270490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3271076Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3271595Z 2025-05-07T19:52:29.3272009Z Selected Source Files: 2025-05-07T19:52:29.3272359Z src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:29.3273053Z src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:29.3273601Z src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:29.3274058Z src/split_embeddings_utils/generate_vbe_metadata.cu 2025-05-07T19:52:29.3274470Z src/split_embeddings_utils/get_infos_metadata.cu 2025-05-07T19:52:29.3274905Z src/split_embeddings_utils/radix_sort_pairs.cu 2025-05-07T19:52:29.3275370Z src/split_embeddings_utils/transpose_embedding_input.cu 2025-05-07T19:52:29.3275737Z 2025-05-07T19:52:29.3275956Z HIPified Source Files: 2025-05-07T19:52:29.3276116Z 2025-05-07T19:52:29.3276198Z 2025-05-07T19:52:29.3276411Z Library Dependencies: 2025-05-07T19:52:29.3276645Z torch 2025-05-07T19:52:29.3276854Z torch_library 2025-05-07T19:52:29.3277298Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3277915Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3278538Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3279340Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3280240Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3280719Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3281117Z 2025-05-07T19:52:29.3281300Z Output Library: 2025-05-07T19:52:29.3281530Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3281741Z 2025-05-07T19:52:29.3281943Z Destination Directory: 2025-05-07T19:52:29.3282185Z fbgemm_gpu 2025-05-07T19:52:29.3282407Z ================================================================================ 2025-05-07T19:52:29.3282631Z 2025-05-07T19:52:29.3282636Z 2025-05-07T19:52:29.3282640Z 2025-05-07T19:52:29.3282767Z ================================================================================ 2025-05-07T19:52:29.3283163Z GPU CPP Library Target: fbgemm_gpu_sparse_async_cumsum (SHARED) 2025-05-07T19:52:29.3283538Z 2025-05-07T19:52:29.3283717Z CPU_SRCS: 2025-05-07T19:52:29.3283948Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:29.3284240Z 2025-05-07T19:52:29.3284414Z GPU_SRCS: 2025-05-07T19:52:29.3284643Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:52:29.3284913Z 2025-05-07T19:52:29.3285115Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3285252Z 2025-05-07T19:52:29.3285329Z 2025-05-07T19:52:29.3285529Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3285664Z 2025-05-07T19:52:29.3285741Z 2025-05-07T19:52:29.3285931Z OTHER_SRCS: 2025-05-07T19:52:29.3286048Z 2025-05-07T19:52:29.3286124Z 2025-05-07T19:52:29.3286312Z CC_FLAGS: 2025-05-07T19:52:29.3286418Z 2025-05-07T19:52:29.3286513Z 2025-05-07T19:52:29.3286683Z NVCC_FLAGS: 2025-05-07T19:52:29.3286795Z 2025-05-07T19:52:29.3286883Z 2025-05-07T19:52:29.3287056Z HIPCC_FLAGS: 2025-05-07T19:52:29.3287172Z 2025-05-07T19:52:29.3287262Z 2025-05-07T19:52:29.3287436Z INCLUDE_DIRS: 2025-05-07T19:52:29.3287672Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3287969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3288253Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3288542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3289020Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3289870Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3290486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3290886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3291290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3292010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3292703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3293188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3293834Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3294345Z 2025-05-07T19:52:29.3294563Z Selected Source Files: 2025-05-07T19:52:29.3294834Z src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:29.3295173Z src/sparse_ops/sparse_async_cumsum.cu 2025-05-07T19:52:29.3295455Z 2025-05-07T19:52:29.3295671Z HIPified Source Files: 2025-05-07T19:52:29.3295830Z 2025-05-07T19:52:29.3295910Z 2025-05-07T19:52:29.3296138Z Library Dependencies: 2025-05-07T19:52:29.3296373Z torch 2025-05-07T19:52:29.3296587Z torch_library 2025-05-07T19:52:29.3297049Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3297645Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3298275Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3299180Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3299859Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3300247Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3300633Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3301049Z 2025-05-07T19:52:29.3301251Z Output Library: 2025-05-07T19:52:29.3301519Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:29.3301786Z 2025-05-07T19:52:29.3302008Z Destination Directory: 2025-05-07T19:52:29.3302251Z fbgemm_gpu 2025-05-07T19:52:29.3302509Z ================================================================================ 2025-05-07T19:52:29.3302746Z 2025-05-07T19:52:29.3302819Z 2025-05-07T19:52:29.3302823Z 2025-05-07T19:52:29.3302943Z ================================================================================ 2025-05-07T19:52:29.3303341Z GPU CPP Library Target: fbgemm_gpu_tbe_common (SHARED) 2025-05-07T19:52:29.3303679Z 2025-05-07T19:52:29.3303891Z CPU_SRCS: 2025-05-07T19:52:29.3304151Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:29.3304605Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:29.3305017Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:29.3305332Z 2025-05-07T19:52:29.3305524Z GPU_SRCS: 2025-05-07T19:52:29.3305766Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:52:29.3306105Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:52:29.3306468Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:29.3306788Z 2025-05-07T19:52:29.3306976Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3307114Z 2025-05-07T19:52:29.3307207Z 2025-05-07T19:52:29.3307393Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3307545Z 2025-05-07T19:52:29.3307623Z 2025-05-07T19:52:29.3307802Z OTHER_SRCS: 2025-05-07T19:52:29.3307935Z 2025-05-07T19:52:29.3308014Z 2025-05-07T19:52:29.3308195Z CC_FLAGS: 2025-05-07T19:52:29.3308324Z 2025-05-07T19:52:29.3308402Z 2025-05-07T19:52:29.3308583Z NVCC_FLAGS: 2025-05-07T19:52:29.3308817Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3309111Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3309391Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3309694Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3309943Z 2025-05-07T19:52:29.3310145Z HIPCC_FLAGS: 2025-05-07T19:52:29.3310268Z 2025-05-07T19:52:29.3310345Z 2025-05-07T19:52:29.3395897Z INCLUDE_DIRS: 2025-05-07T19:52:29.3396385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3396745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3397027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3397352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3397875Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3398683Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3399356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3399778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3400324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3400802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3401346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3401849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3402417Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3402980Z 2025-05-07T19:52:29.3403174Z Selected Source Files: 2025-05-07T19:52:29.3403488Z codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:29.3403906Z codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:29.3404324Z codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:29.3404691Z codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:29.3405030Z codegen/utils/embedding_bounds_check_v1.cu 2025-05-07T19:52:29.3405382Z codegen/utils/embedding_bounds_check_v2.cu 2025-05-07T19:52:29.3405678Z 2025-05-07T19:52:29.3405884Z HIPified Source Files: 2025-05-07T19:52:29.3406050Z 2025-05-07T19:52:29.3406125Z 2025-05-07T19:52:29.3406320Z Library Dependencies: 2025-05-07T19:52:29.3406552Z torch 2025-05-07T19:52:29.3406750Z torch_library 2025-05-07T19:52:29.3407194Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3407794Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3408410Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3409215Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3409875Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3410241Z fbgemm 2025-05-07T19:52:29.3410449Z fbgemm_gpu_config 2025-05-07T19:52:29.3410796Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3411206Z 2025-05-07T19:52:29.3411389Z Output Library: 2025-05-07T19:52:29.3411616Z fbgemm_gpu_tbe_common 2025-05-07T19:52:29.3411853Z 2025-05-07T19:52:29.3412037Z Destination Directory: 2025-05-07T19:52:29.3412276Z fbgemm_gpu 2025-05-07T19:52:29.3412502Z ================================================================================ 2025-05-07T19:52:29.3412751Z 2025-05-07T19:52:29.3412755Z 2025-05-07T19:52:29.3412759Z 2025-05-07T19:52:29.3412874Z ================================================================================ 2025-05-07T19:52:29.3413270Z GPU CPP Library Target: fbgemm_gpu_tbe_optimizers (SHARED) 2025-05-07T19:52:29.3413625Z 2025-05-07T19:52:29.3413811Z CPU_SRCS: 2025-05-07T19:52:29.3413922Z 2025-05-07T19:52:29.3413998Z 2025-05-07T19:52:29.3414180Z GPU_SRCS: 2025-05-07T19:52:29.3414426Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:29.3414831Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:29.3415242Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:29.3415585Z 2025-05-07T19:52:29.3415770Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3415919Z 2025-05-07T19:52:29.3415998Z 2025-05-07T19:52:29.3416180Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3416333Z 2025-05-07T19:52:29.3416409Z 2025-05-07T19:52:29.3416598Z OTHER_SRCS: 2025-05-07T19:52:29.3416714Z 2025-05-07T19:52:29.3418051Z 2025-05-07T19:52:29.3418260Z CC_FLAGS: 2025-05-07T19:52:29.3418373Z 2025-05-07T19:52:29.3418445Z 2025-05-07T19:52:29.3418629Z NVCC_FLAGS: 2025-05-07T19:52:29.3418843Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3419119Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3419390Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3419701Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3419948Z 2025-05-07T19:52:29.3420143Z HIPCC_FLAGS: 2025-05-07T19:52:29.3420260Z 2025-05-07T19:52:29.3420343Z 2025-05-07T19:52:29.3420517Z INCLUDE_DIRS: 2025-05-07T19:52:29.3420759Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3421140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3421427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3421727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3422243Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3423039Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3423693Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3424115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3424542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3425019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3425528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3425992Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3426546Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3427082Z 2025-05-07T19:52:29.3427299Z Selected Source Files: 2025-05-07T19:52:29.3427588Z gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:29.3428002Z gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu 2025-05-07T19:52:29.3428413Z gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu 2025-05-07T19:52:29.3428771Z 2025-05-07T19:52:29.3428964Z HIPified Source Files: 2025-05-07T19:52:29.3429130Z 2025-05-07T19:52:29.3429207Z 2025-05-07T19:52:29.3429399Z Library Dependencies: 2025-05-07T19:52:29.3429641Z torch 2025-05-07T19:52:29.3429830Z torch_library 2025-05-07T19:52:29.3430271Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3430869Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3431709Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3432532Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3433286Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3433806Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3434211Z 2025-05-07T19:52:29.3434405Z Output Library: 2025-05-07T19:52:29.3434658Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:52:29.3434880Z 2025-05-07T19:52:29.3435060Z Destination Directory: 2025-05-07T19:52:29.3435277Z fbgemm_gpu 2025-05-07T19:52:29.3435499Z ================================================================================ 2025-05-07T19:52:29.3435725Z 2025-05-07T19:52:29.3435729Z 2025-05-07T19:52:29.3435733Z 2025-05-07T19:52:29.3435843Z ================================================================================ 2025-05-07T19:52:29.3436250Z GPU CPP Library Target: fbgemm_gpu_tbe_training_forward (SHARED) 2025-05-07T19:52:29.3436613Z 2025-05-07T19:52:29.3436781Z CPU_SRCS: 2025-05-07T19:52:29.3437042Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3437358Z 2025-05-07T19:52:29.3437550Z GPU_SRCS: 2025-05-07T19:52:29.3437815Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:29.3438186Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:29.3438551Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:29.3439064Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:29.3439493Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:29.3439904Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:29.3440325Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:29.3440697Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:29.3441062Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:29.3441437Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:29.3441826Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:29.3442309Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.3442733Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:29.3443497Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:29.3443891Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:29.3444304Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.3444724Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:29.3445119Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:29.3445509Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:29.3445896Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.3446300Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:29.3446712Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3447145Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3447555Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:29.3447931Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:29.3448321Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3448755Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3449172Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:29.3449557Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3449947Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:29.3450329Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:29.3450733Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3451152Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3451536Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:29.3451948Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3452383Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3452819Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:29.3453208Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:29.3453631Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3454087Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3454540Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:29.3454957Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3455372Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:29.3455784Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:29.3456198Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3456625Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3457030Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:29.3457457Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:29.3457926Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:29.3458371Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:29.3458784Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3459145Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3459533Z 2025-05-07T19:52:29.3459723Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3459871Z 2025-05-07T19:52:29.3459943Z 2025-05-07T19:52:29.3460123Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3460266Z 2025-05-07T19:52:29.3460340Z 2025-05-07T19:52:29.3460521Z OTHER_SRCS: 2025-05-07T19:52:29.3460638Z 2025-05-07T19:52:29.3460712Z 2025-05-07T19:52:29.3460895Z CC_FLAGS: 2025-05-07T19:52:29.3461004Z 2025-05-07T19:52:29.3461078Z 2025-05-07T19:52:29.3461256Z NVCC_FLAGS: 2025-05-07T19:52:29.3461467Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3461736Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3462010Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3462395Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3462637Z 2025-05-07T19:52:29.3462826Z HIPCC_FLAGS: 2025-05-07T19:52:29.3462946Z 2025-05-07T19:52:29.3463032Z 2025-05-07T19:52:29.3463209Z INCLUDE_DIRS: 2025-05-07T19:52:29.3463442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3463742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3464032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3464332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3464829Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3465616Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3466266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3466685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3467105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3467587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3468093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3468558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3469113Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3469616Z 2025-05-07T19:52:29.3469803Z Selected Source Files: 2025-05-07T19:52:29.3470100Z gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3470498Z gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:29.3470901Z gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:29.3471321Z gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:52:29.3471722Z gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:52:29.3472126Z gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:52:29.3472519Z gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:52:29.3473054Z gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3473485Z gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3473919Z gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3474370Z gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:52:29.3474780Z gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3475156Z gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3475515Z gen_embedding_forward_split_weighted_kernel.cu 2025-05-07T19:52:29.3475878Z gen_embedding_forward_dense_weighted_kernel.cu 2025-05-07T19:52:29.3476223Z gen_embedding_forward_ssd_weighted_kernel.cu 2025-05-07T19:52:29.3476605Z gen_embedding_forward_split_unweighted_nobag_kernel.cu 2025-05-07T19:52:29.3477021Z gen_embedding_forward_dense_unweighted_nobag_kernel.cu 2025-05-07T19:52:29.3477417Z gen_embedding_forward_ssd_unweighted_nobag_kernel.cu 2025-05-07T19:52:29.3477803Z gen_embedding_forward_split_unweighted_kernel.cu 2025-05-07T19:52:29.3478161Z gen_embedding_forward_dense_unweighted_kernel.cu 2025-05-07T19:52:29.3478532Z gen_embedding_forward_ssd_unweighted_kernel.cu 2025-05-07T19:52:29.3478903Z gen_embedding_forward_split_weighted_codegen_cuda.cu 2025-05-07T19:52:29.3479514Z gen_embedding_forward_split_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.3479989Z gen_embedding_forward_dense_weighted_codegen_cuda.cu 2025-05-07T19:52:29.3480392Z gen_embedding_forward_dense_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.3480784Z gen_embedding_forward_ssd_weighted_codegen_cuda.cu 2025-05-07T19:52:29.3481167Z gen_embedding_forward_ssd_unweighted_codegen_cuda.cu 2025-05-07T19:52:29.3481581Z gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3481971Z gen_embedding_forward_split_weighted_vbe_kernel.cu 2025-05-07T19:52:29.3482348Z gen_embedding_forward_split_weighted_v2_kernel.cu 2025-05-07T19:52:29.3482738Z gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3483244Z gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3483669Z gen_embedding_forward_split_weighted_gwd_kernel.cu 2025-05-07T19:52:29.3484058Z gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3484458Z gen_embedding_forward_dense_weighted_vbe_kernel.cu 2025-05-07T19:52:29.3484847Z gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu 2025-05-07T19:52:29.3485254Z gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3485628Z gen_embedding_forward_ssd_weighted_vbe_kernel.cu 2025-05-07T19:52:29.3486037Z gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3486447Z gen_embedding_forward_split_unweighted_vbe_kernel.cu 2025-05-07T19:52:29.3486837Z gen_embedding_forward_split_unweighted_v2_kernel.cu 2025-05-07T19:52:29.3487259Z gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3487706Z gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu 2025-05-07T19:52:29.3488152Z gen_embedding_forward_split_unweighted_gwd_kernel.cu 2025-05-07T19:52:29.3488559Z gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3488974Z gen_embedding_forward_dense_unweighted_vbe_kernel.cu 2025-05-07T19:52:29.3489376Z gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu 2025-05-07T19:52:29.3489798Z gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu 2025-05-07T19:52:29.3490200Z gen_embedding_forward_ssd_unweighted_vbe_kernel.cu 2025-05-07T19:52:29.3490737Z gen_embedding_forward_split_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:29.3491205Z gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:29.3491654Z gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu 2025-05-07T19:52:29.3492002Z 2025-05-07T19:52:29.3492195Z HIPified Source Files: 2025-05-07T19:52:29.3492357Z 2025-05-07T19:52:29.3492458Z 2025-05-07T19:52:29.3492641Z Library Dependencies: 2025-05-07T19:52:29.3492873Z torch 2025-05-07T19:52:29.3493060Z torch_library 2025-05-07T19:52:29.3493498Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3494096Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3494685Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3495485Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3496148Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3496591Z fbgemm_gpu_tbe_common 2025-05-07T19:52:29.3496938Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3497333Z 2025-05-07T19:52:29.3497534Z Output Library: 2025-05-07T19:52:29.3497758Z fbgemm_gpu_tbe_training_forward 2025-05-07T19:52:29.3498013Z 2025-05-07T19:52:29.3498199Z Destination Directory: 2025-05-07T19:52:29.3498433Z fbgemm_gpu 2025-05-07T19:52:29.3498659Z ================================================================================ 2025-05-07T19:52:29.3498903Z 2025-05-07T19:52:29.3498907Z 2025-05-07T19:52:29.3498911Z 2025-05-07T19:52:29.3499021Z ================================================================================ 2025-05-07T19:52:29.3499461Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_pt2 (SHARED) 2025-05-07T19:52:29.3499925Z 2025-05-07T19:52:29.3500119Z CPU_SRCS: 2025-05-07T19:52:29.3500349Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3500733Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3501093Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:29.3501429Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:29.3501752Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:29.3502094Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:29.3502480Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:29.3502922Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:29.3503370Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:29.3503770Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:29.3504211Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:29.3504616Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3505128Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:29.3505697Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:29.3506276Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:29.3506788Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3507213Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3507624Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3508074Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3508535Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3508930Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3509338Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3509755Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3510233Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3510776Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3511241Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3511741Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3512262Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3512866Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3513481Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3514153Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3514824Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3515419Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3515844Z 2025-05-07T19:52:29.3516023Z GPU_SRCS: 2025-05-07T19:52:29.3516314Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3516784Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3517231Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3517643Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3518048Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3518486Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3518972Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3519537Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3520039Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3520538Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3521153Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3521658Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3522279Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3522950Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3523635Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3524250Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3524781Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3525333Z 2025-05-07T19:52:29.3525505Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3525644Z 2025-05-07T19:52:29.3525712Z 2025-05-07T19:52:29.3525882Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3526016Z 2025-05-07T19:52:29.3526082Z 2025-05-07T19:52:29.3526242Z OTHER_SRCS: 2025-05-07T19:52:29.3526361Z 2025-05-07T19:52:29.3526427Z 2025-05-07T19:52:29.3526603Z CC_FLAGS: 2025-05-07T19:52:29.3526706Z 2025-05-07T19:52:29.3526775Z 2025-05-07T19:52:29.3526936Z NVCC_FLAGS: 2025-05-07T19:52:29.3527123Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3527374Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3527625Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3527900Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3528117Z 2025-05-07T19:52:29.3528291Z HIPCC_FLAGS: 2025-05-07T19:52:29.3528399Z 2025-05-07T19:52:29.3528464Z 2025-05-07T19:52:29.3528634Z INCLUDE_DIRS: 2025-05-07T19:52:29.3528843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3529127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3529387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3529665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3530114Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3530836Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3531766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3532349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3532778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3533243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3533756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3534212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3534760Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3535264Z 2025-05-07T19:52:29.3535445Z Selected Source Files: 2025-05-07T19:52:29.3535718Z gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3536084Z gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3536453Z gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:52:29.3536789Z gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:52:29.3537108Z gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:52:29.3537444Z gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:52:29.3537827Z gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:52:29.3538258Z gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:52:29.3538630Z gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:52:29.3539050Z gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:29.3539477Z gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:52:29.3539887Z gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3540407Z gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:52:29.3540972Z gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:29.3541543Z gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:52:29.3542189Z gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3542635Z gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:52:29.3543044Z gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3543527Z gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3543997Z gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3544486Z gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3544865Z gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3545244Z gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3545773Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3546273Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3546708Z gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3547170Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3547653Z gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3548117Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3548666Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3549286Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3549898Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3550443Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:52:29.3550914Z gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3551345Z gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3551770Z gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3552140Z gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3552523Z gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3553196Z gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3553681Z gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3554225Z gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3554697Z gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3555199Z gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3555728Z gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3556243Z gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3556847Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3557511Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3558174Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3558768Z gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3559300Z gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:52:29.3559662Z 2025-05-07T19:52:29.3559865Z HIPified Source Files: 2025-05-07T19:52:29.3560017Z 2025-05-07T19:52:29.3560098Z 2025-05-07T19:52:29.3560274Z Library Dependencies: 2025-05-07T19:52:29.3560497Z torch 2025-05-07T19:52:29.3560673Z torch_library 2025-05-07T19:52:29.3561104Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3561686Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3562283Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3563073Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3563795Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3564165Z fbgemm 2025-05-07T19:52:29.3564344Z fbgemm_gpu_config 2025-05-07T19:52:29.3564561Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:29.3564779Z fbgemm_gpu_tbe_common 2025-05-07T19:52:29.3565009Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3565236Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:29.3565721Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3566075Z 2025-05-07T19:52:29.3566243Z Output Library: 2025-05-07T19:52:29.3566452Z fbgemm_gpu_tbe_training_backward_pt2 2025-05-07T19:52:29.3566703Z 2025-05-07T19:52:29.3566937Z Destination Directory: 2025-05-07T19:52:29.3567141Z fbgemm_gpu 2025-05-07T19:52:29.3567347Z ================================================================================ 2025-05-07T19:52:29.3567557Z 2025-05-07T19:52:29.3567561Z 2025-05-07T19:52:29.3567564Z 2025-05-07T19:52:29.3567665Z ================================================================================ 2025-05-07T19:52:29.3568055Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward (SHARED) 2025-05-07T19:52:29.3568394Z 2025-05-07T19:52:29.3568560Z CPU_SRCS: 2025-05-07T19:52:29.3568852Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:52:29.3569238Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:29.3569558Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:29.3569893Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:29.3570232Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:29.3570523Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:29.3570831Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:29.3571139Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:29.3571497Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:29.3571893Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:29.3572237Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:29.3572625Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:29.3573020Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:29.3573400Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:29.3573861Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:29.3574391Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:29.3574918Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:29.3575381Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:29.3575770Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:29.3576105Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:29.3576446Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:29.3576704Z 2025-05-07T19:52:29.3576873Z GPU_SRCS: 2025-05-07T19:52:29.3577095Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:29.3577478Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:29.3577884Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:29.3578294Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:29.3578692Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:29.3579115Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:29.3579579Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:29.3580039Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3580693Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3581207Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3581697Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:29.3582209Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3582688Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3583121Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:29.3583507Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3583933Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3584357Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3584815Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3585306Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3585791Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:29.3586204Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3586634Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3587077Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:29.3587531Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3588015Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3588509Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3589016Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3589553Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3590049Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:29.3590518Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3591009Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3591442Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:29.3591806Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3592190Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3592664Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3593265Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3593771Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3594201Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:29.3594610Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3595039Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3595438Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:29.3595846Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3596259Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3596691Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3597143Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3597644Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3598079Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:29.3598490Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3598929Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3599337Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:29.3599735Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3600147Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3600572Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3601026Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3601515Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3601957Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:29.3602363Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3602871Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3603291Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:29.3603717Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3604169Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3604634Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3605132Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3605747Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3606265Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:29.3606688Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3607150Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3607612Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:29.3608136Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3608683Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3609215Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3609786Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3610373Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3610935Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:29.3611449Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3611997Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3612520Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:29.3613012Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3613552Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3614074Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3614639Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3615218Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3615769Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:29.3616293Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3616834Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3617304Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:29.3617687Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3618107Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3618521Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3618972Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3619450Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3619869Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:29.3620273Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3620692Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3621180Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:29.3621730Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3622324Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3622921Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3623608Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3624265Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3624857Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:29.3625439Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3626045Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3626472Z 2025-05-07T19:52:29.3626667Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3626802Z 2025-05-07T19:52:29.3626878Z 2025-05-07T19:52:29.3627136Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3627462Z gen_embedding_backward_split_unweighted_nobag_device_kernel_hip.hip 2025-05-07T19:52:29.3627936Z gen_embedding_backward_split_weighted_device_kernel_hip.hip 2025-05-07T19:52:29.3628371Z gen_embedding_backward_split_unweighted_device_kernel_hip.hip 2025-05-07T19:52:29.3628728Z 2025-05-07T19:52:29.3628899Z OTHER_SRCS: 2025-05-07T19:52:29.3629032Z 2025-05-07T19:52:29.3629105Z 2025-05-07T19:52:29.3629298Z CC_FLAGS: 2025-05-07T19:52:29.3629403Z 2025-05-07T19:52:29.3629475Z 2025-05-07T19:52:29.3629664Z NVCC_FLAGS: 2025-05-07T19:52:29.3629866Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3630107Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3630352Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3630615Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3630836Z 2025-05-07T19:52:29.3631000Z HIPCC_FLAGS: 2025-05-07T19:52:29.3631108Z 2025-05-07T19:52:29.3631172Z 2025-05-07T19:52:29.3631485Z INCLUDE_DIRS: 2025-05-07T19:52:29.3631968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3632270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3632550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3632914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3633401Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3634183Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3634831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3635235Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3635664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3636137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3636645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3637100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3637649Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3638153Z 2025-05-07T19:52:29.3638333Z Selected Source Files: 2025-05-07T19:52:29.3638687Z codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:52:29.3639120Z gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:52:29.3639457Z gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:52:29.3639830Z gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:29.3640183Z gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:52:29.3640501Z gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:52:29.3640818Z gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:52:29.3641151Z gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:52:29.3641529Z gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:52:29.3641954Z gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:52:29.3642333Z gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:52:29.3642735Z gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:29.3643162Z gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:52:29.3643558Z gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:52:29.3644062Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:52:29.3644753Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:29.3645418Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:52:29.3645890Z gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:52:29.3646266Z gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:52:29.3646611Z gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:52:29.3646938Z gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:52:29.3647256Z gen_embedding_backward_split_grad_embedding_ops.cu 2025-05-07T19:52:29.3647629Z gen_embedding_backward_split_indice_weights_codegen_cuda.cu 2025-05-07T19:52:29.3648110Z gen_embedding_backward_dense_indice_weights_codegen_cuda.cu 2025-05-07T19:52:29.3648689Z gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu 2025-05-07T19:52:29.3649114Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu 2025-05-07T19:52:29.3649571Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu 2025-05-07T19:52:29.3650055Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu 2025-05-07T19:52:29.3650551Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3651066Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3651613Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3652112Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu 2025-05-07T19:52:29.3652587Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3653091Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3653534Z gen_embedding_backward_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:29.3653946Z gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3654377Z gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3654829Z gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3655293Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3655799Z gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3656256Z gen_embedding_backward_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:29.3656674Z gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3657134Z gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3657583Z gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu 2025-05-07T19:52:29.3658057Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3658552Z gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3659061Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3659599Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3660156Z gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3660677Z gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu 2025-05-07T19:52:29.3661159Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3661678Z gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3662119Z gen_embedding_backward_sgd_split_weighted_cuda.cu 2025-05-07T19:52:29.3662503Z gen_embedding_backward_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3662912Z gen_embedding_backward_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3663322Z gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3663763Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3664229Z gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3664751Z gen_embedding_backward_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:29.3665173Z gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3665576Z gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3665946Z gen_embedding_backward_adam_split_weighted_cuda.cu 2025-05-07T19:52:29.3666313Z gen_embedding_backward_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3666705Z gen_embedding_backward_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3667096Z gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3667520Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3667968Z gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3669097Z gen_embedding_backward_adam_split_unweighted_cuda.cu 2025-05-07T19:52:29.3669476Z gen_embedding_backward_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3669883Z gen_embedding_backward_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3670258Z gen_embedding_backward_lamb_split_weighted_cuda.cu 2025-05-07T19:52:29.3670621Z gen_embedding_backward_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3671014Z gen_embedding_backward_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3671400Z gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3671821Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3672266Z gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3672766Z gen_embedding_backward_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:29.3673340Z gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3673775Z gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3674216Z gen_embedding_backward_lars_sgd_split_weighted_cuda.cu 2025-05-07T19:52:29.3674634Z gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3675090Z gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3675550Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3676048Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3676569Z gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3677037Z gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu 2025-05-07T19:52:29.3677485Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3677954Z gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3678445Z gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu 2025-05-07T19:52:29.3678966Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3679523Z gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3680080Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3680666Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3681290Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3682043Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu 2025-05-07T19:52:29.3682581Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3683137Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3683688Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu 2025-05-07T19:52:29.3684211Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3684753Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3685427Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3685960Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3686529Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3687126Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu 2025-05-07T19:52:29.3687628Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3688155Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3688596Z gen_embedding_backward_none_split_weighted_cuda.cu 2025-05-07T19:52:29.3688976Z gen_embedding_backward_none_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3689360Z gen_embedding_backward_none_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3689756Z gen_embedding_backward_none_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3690249Z gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3690704Z gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3691118Z gen_embedding_backward_none_split_unweighted_cuda.cu 2025-05-07T19:52:29.3691502Z gen_embedding_backward_none_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3691914Z gen_embedding_backward_none_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3692366Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu 2025-05-07T19:52:29.3692916Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3693477Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3694047Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3694642Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3695269Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3695852Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu 2025-05-07T19:52:29.3696402Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3696990Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3697400Z 2025-05-07T19:52:29.3697574Z HIPified Source Files: 2025-05-07T19:52:29.3697714Z 2025-05-07T19:52:29.3697787Z 2025-05-07T19:52:29.3697952Z Library Dependencies: 2025-05-07T19:52:29.3698155Z torch 2025-05-07T19:52:29.3698321Z torch_library 2025-05-07T19:52:29.3698725Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3699263Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3699813Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3700535Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3701136Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3701475Z fbgemm 2025-05-07T19:52:29.3701645Z fbgemm_gpu_config 2025-05-07T19:52:29.3701842Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:29.3702044Z fbgemm_gpu_tbe_common 2025-05-07T19:52:29.3702253Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3702467Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:29.3702823Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3703175Z 2025-05-07T19:52:29.3703346Z Output Library: 2025-05-07T19:52:29.3703548Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:29.3703779Z 2025-05-07T19:52:29.3703949Z Destination Directory: 2025-05-07T19:52:29.3704149Z fbgemm_gpu 2025-05-07T19:52:29.3704359Z ================================================================================ 2025-05-07T19:52:29.3704572Z 2025-05-07T19:52:29.3704579Z 2025-05-07T19:52:29.3704583Z 2025-05-07T19:52:29.3704680Z ================================================================================ 2025-05-07T19:52:29.3705074Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_gwd (SHARED) 2025-05-07T19:52:29.3705422Z 2025-05-07T19:52:29.3705591Z CPU_SRCS: 2025-05-07T19:52:29.3705689Z 2025-05-07T19:52:29.3705820Z 2025-05-07T19:52:29.3705974Z GPU_SRCS: 2025-05-07T19:52:29.3706254Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:29.3706729Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:29.3707241Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:29.3707728Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:29.3708224Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:29.3708737Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:29.3709295Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:29.3709804Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:29.3710330Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:29.3710857Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:29.3711371Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:29.3711925Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:29.3712322Z 2025-05-07T19:52:29.3712485Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3712684Z 2025-05-07T19:52:29.3712758Z 2025-05-07T19:52:29.3712915Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3713039Z 2025-05-07T19:52:29.3713110Z 2025-05-07T19:52:29.3713446Z OTHER_SRCS: 2025-05-07T19:52:29.3713569Z 2025-05-07T19:52:29.3713639Z 2025-05-07T19:52:29.3713801Z CC_FLAGS: 2025-05-07T19:52:29.3713963Z 2025-05-07T19:52:29.3714035Z 2025-05-07T19:52:29.3714208Z NVCC_FLAGS: 2025-05-07T19:52:29.3714418Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3714688Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3714952Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3715243Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3715483Z 2025-05-07T19:52:29.3715665Z HIPCC_FLAGS: 2025-05-07T19:52:29.3715786Z 2025-05-07T19:52:29.3715857Z 2025-05-07T19:52:29.3716037Z INCLUDE_DIRS: 2025-05-07T19:52:29.3716255Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3716564Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3716832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3717137Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3717623Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3718401Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3719048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3719447Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3719873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3720331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3720848Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3721304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3721846Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3722343Z 2025-05-07T19:52:29.3722525Z Selected Source Files: 2025-05-07T19:52:29.3722873Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu 2025-05-07T19:52:29.3723389Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu 2025-05-07T19:52:29.3723940Z gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu 2025-05-07T19:52:29.3724472Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu 2025-05-07T19:52:29.3725002Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu 2025-05-07T19:52:29.3725677Z gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu 2025-05-07T19:52:29.3726253Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu 2025-05-07T19:52:29.3726761Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:29.3727289Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:29.3727807Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu 2025-05-07T19:52:29.3728324Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu 2025-05-07T19:52:29.3728553Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu 2025-05-07T19:52:29.3728623Z 2025-05-07T19:52:29.3728705Z HIPified Source Files: 2025-05-07T19:52:29.3728770Z 2025-05-07T19:52:29.3728850Z 2025-05-07T19:52:29.3728929Z Library Dependencies: 2025-05-07T19:52:29.3728996Z torch 2025-05-07T19:52:29.3729072Z torch_library 2025-05-07T19:52:29.3729357Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3729513Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3729813Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3730139Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3730315Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3730407Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:29.3730604Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3730672Z 2025-05-07T19:52:29.3730748Z Output Library: 2025-05-07T19:52:29.3730848Z fbgemm_gpu_tbe_training_backward_gwd 2025-05-07T19:52:29.3730919Z 2025-05-07T19:52:29.3730998Z Destination Directory: 2025-05-07T19:52:29.3731067Z fbgemm_gpu 2025-05-07T19:52:29.3731177Z ================================================================================ 2025-05-07T19:52:29.3731182Z 2025-05-07T19:52:29.3731185Z 2025-05-07T19:52:29.3731189Z 2025-05-07T19:52:29.3731461Z ================================================================================ 2025-05-07T19:52:29.3731648Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_vbe (SHARED) 2025-05-07T19:52:29.3731721Z 2025-05-07T19:52:29.3732061Z CPU_SRCS: 2025-05-07T19:52:29.3732065Z 2025-05-07T19:52:29.3732136Z 2025-05-07T19:52:29.3732218Z GPU_SRCS: 2025-05-07T19:52:29.3732417Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3732610Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3732812Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3733021Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3733266Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3733517Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3733677Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3733836Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3733989Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3734160Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3734310Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3734466Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3734653Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3734876Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3735099Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3735280Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3735493Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3735703Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3736014Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3736248Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3736473Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3736667Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3736893Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3737113Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3737357Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3737689Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3737962Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3738216Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3738484Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3738762Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3738911Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3739078Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3739256Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3739404Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3739590Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3739781Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3739929Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3740104Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3740290Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3740457Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3740643Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3740833Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3740989Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3741158Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3741334Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3741500Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3741683Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3741864Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3741936Z 2025-05-07T19:52:29.3742031Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3742036Z 2025-05-07T19:52:29.3742108Z 2025-05-07T19:52:29.3742195Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3742199Z 2025-05-07T19:52:29.3742276Z 2025-05-07T19:52:29.3742353Z OTHER_SRCS: 2025-05-07T19:52:29.3742357Z 2025-05-07T19:52:29.3742426Z 2025-05-07T19:52:29.3742500Z CC_FLAGS: 2025-05-07T19:52:29.3742513Z 2025-05-07T19:52:29.3742580Z 2025-05-07T19:52:29.3742655Z NVCC_FLAGS: 2025-05-07T19:52:29.3742753Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3742853Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3742949Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3743038Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3743108Z 2025-05-07T19:52:29.3743191Z HIPCC_FLAGS: 2025-05-07T19:52:29.3743199Z 2025-05-07T19:52:29.3743269Z 2025-05-07T19:52:29.3743346Z INCLUDE_DIRS: 2025-05-07T19:52:29.3743460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3743553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3743653Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3743760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3744198Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3744560Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3744689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3744840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3744981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3745165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3745356Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3745561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3745841Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3745914Z 2025-05-07T19:52:29.3745996Z Selected Source Files: 2025-05-07T19:52:29.3746184Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3746358Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3746555Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3746733Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3746959Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3747196Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3747332Z gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3747480Z gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3761473Z gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3761735Z gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3761916Z gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:52:29.3762092Z gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:52:29.3762290Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3762515Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3762743Z gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3762924Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3763130Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3763352Z gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3763557Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3763782Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3764011Z gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3764202Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3764422Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3764644Z gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3764885Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3765145Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3765528Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3765779Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3766042Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3766302Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3766447Z gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3766745Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3766911Z gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3767071Z gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3767243Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3767419Z gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3767565Z gen_embedding_backward_dense_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3767740Z gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3767916Z gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3768132Z gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3768318Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3768499Z gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3768639Z gen_embedding_backward_adam_split_weighted_vbe_cuda.cu 2025-05-07T19:52:29.3768821Z gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3768990Z gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3769139Z gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu 2025-05-07T19:52:29.3769311Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu 2025-05-07T19:52:29.3769494Z gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu 2025-05-07T19:52:29.3769567Z 2025-05-07T19:52:29.3769655Z HIPified Source Files: 2025-05-07T19:52:29.3769662Z 2025-05-07T19:52:29.3769747Z 2025-05-07T19:52:29.3769833Z Library Dependencies: 2025-05-07T19:52:29.3769909Z torch 2025-05-07T19:52:29.3769987Z torch_library 2025-05-07T19:52:29.3770293Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3770452Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3770776Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3771119Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3771298Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3771399Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:29.3771611Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3771677Z 2025-05-07T19:52:29.3771755Z Output Library: 2025-05-07T19:52:29.3771864Z fbgemm_gpu_tbe_training_backward_vbe 2025-05-07T19:52:29.3771929Z 2025-05-07T19:52:29.3772016Z Destination Directory: 2025-05-07T19:52:29.3772093Z fbgemm_gpu 2025-05-07T19:52:29.3772207Z ================================================================================ 2025-05-07T19:52:29.3772212Z 2025-05-07T19:52:29.3772217Z 2025-05-07T19:52:29.3772220Z 2025-05-07T19:52:29.3772323Z ================================================================================ 2025-05-07T19:52:29.3772536Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_dense (SHARED) 2025-05-07T19:52:29.3772610Z 2025-05-07T19:52:29.3772681Z CPU_SRCS: 2025-05-07T19:52:29.3772686Z 2025-05-07T19:52:29.3772754Z 2025-05-07T19:52:29.3772825Z GPU_SRCS: 2025-05-07T19:52:29.3772960Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:29.3773101Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:29.3773259Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3773422Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3773696Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3773856Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:29.3774044Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3774223Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3774359Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:29.3774575Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:29.3774734Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3774892Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3774993Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:29.3775069Z 2025-05-07T19:52:29.3775148Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3775153Z 2025-05-07T19:52:29.3775218Z 2025-05-07T19:52:29.3775300Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3775304Z 2025-05-07T19:52:29.3775369Z 2025-05-07T19:52:29.3775440Z OTHER_SRCS: 2025-05-07T19:52:29.3775444Z 2025-05-07T19:52:29.3775562Z 2025-05-07T19:52:29.3775637Z CC_FLAGS: 2025-05-07T19:52:29.3775640Z 2025-05-07T19:52:29.3775704Z 2025-05-07T19:52:29.3775776Z NVCC_FLAGS: 2025-05-07T19:52:29.3775871Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3775958Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3776051Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3776140Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3776210Z 2025-05-07T19:52:29.3776283Z HIPCC_FLAGS: 2025-05-07T19:52:29.3776288Z 2025-05-07T19:52:29.3776353Z 2025-05-07T19:52:29.3776435Z INCLUDE_DIRS: 2025-05-07T19:52:29.3776536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3776628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3776733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3776830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3777091Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3777458Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3777603Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3777752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3777894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3778092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3778279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3778412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3778704Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3778770Z 2025-05-07T19:52:29.3779002Z Selected Source Files: 2025-05-07T19:52:29.3779139Z gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:52:29.3779307Z gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T19:52:29.3779445Z gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T19:52:29.3779549Z gen_embedding_backward_split_dense.cpp 2025-05-07T19:52:29.3779684Z gen_embedding_backward_dense_split_weighted_cuda.cu 2025-05-07T19:52:29.3779829Z gen_embedding_backward_dense_split_weighted_kernel_cta.cu 2025-05-07T19:52:29.3779981Z gen_embedding_backward_dense_split_weighted_kernel_warp.cu 2025-05-07T19:52:29.3780142Z gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu 2025-05-07T19:52:29.3780331Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu 2025-05-07T19:52:29.3780511Z gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu 2025-05-07T19:52:29.3780645Z gen_embedding_backward_dense_split_unweighted_cuda.cu 2025-05-07T19:52:29.3780809Z gen_embedding_backward_dense_split_unweighted_kernel_cta.cu 2025-05-07T19:52:29.3780966Z gen_embedding_backward_dense_split_unweighted_kernel_warp.cu 2025-05-07T19:52:29.3781030Z 2025-05-07T19:52:29.3781123Z HIPified Source Files: 2025-05-07T19:52:29.3781127Z 2025-05-07T19:52:29.3781194Z 2025-05-07T19:52:29.3781276Z Library Dependencies: 2025-05-07T19:52:29.3781344Z torch 2025-05-07T19:52:29.3781423Z torch_library 2025-05-07T19:52:29.3781706Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3781856Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3782219Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3782542Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3782712Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3782811Z fbgemm_gpu_tbe_training_backward 2025-05-07T19:52:29.3783000Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3783066Z 2025-05-07T19:52:29.3783140Z Output Library: 2025-05-07T19:52:29.3783246Z fbgemm_gpu_tbe_training_backward_dense 2025-05-07T19:52:29.3783365Z 2025-05-07T19:52:29.3783447Z Destination Directory: 2025-05-07T19:52:29.3783523Z fbgemm_gpu 2025-05-07T19:52:29.3783624Z ================================================================================ 2025-05-07T19:52:29.3783629Z 2025-05-07T19:52:29.3783632Z 2025-05-07T19:52:29.3783636Z 2025-05-07T19:52:29.3783740Z ================================================================================ 2025-05-07T19:52:29.3783953Z GPU CPP Library Target: fbgemm_gpu_tbe_training_backward_split_host (SHARED) 2025-05-07T19:52:29.3784017Z 2025-05-07T19:52:29.3784086Z CPU_SRCS: 2025-05-07T19:52:29.3784090Z 2025-05-07T19:52:29.3784157Z 2025-05-07T19:52:29.3784229Z GPU_SRCS: 2025-05-07T19:52:29.3784333Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:29.3784455Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:29.3784560Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:29.3784656Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:29.3784749Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:29.3784851Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:29.3784993Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:29.3785128Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:29.3785226Z gen_embedding_backward_split_none.cpp 2025-05-07T19:52:29.3785405Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:29.3785511Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:29.3785651Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:29.3785848Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:29.3786053Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:29.3786232Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:29.3786380Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:29.3786502Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:29.3786646Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:29.3786791Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:29.3786966Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:29.3787143Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:29.3787274Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:29.3787411Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:29.3787539Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:29.3787675Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:29.3787798Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:29.3787942Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:29.3788081Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:29.3788229Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:29.3788424Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:29.3788616Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:29.3788798Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:29.3788993Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:29.3789189Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:29.3789329Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:29.3789539Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:29.3789771Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:29.3789846Z 2025-05-07T19:52:29.3789934Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3789938Z 2025-05-07T19:52:29.3790016Z 2025-05-07T19:52:29.3790095Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3790099Z 2025-05-07T19:52:29.3790163Z 2025-05-07T19:52:29.3790307Z OTHER_SRCS: 2025-05-07T19:52:29.3790311Z 2025-05-07T19:52:29.3790376Z 2025-05-07T19:52:29.3790446Z CC_FLAGS: 2025-05-07T19:52:29.3790449Z 2025-05-07T19:52:29.3790516Z 2025-05-07T19:52:29.3790600Z NVCC_FLAGS: 2025-05-07T19:52:29.3790691Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3790782Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3790892Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3790978Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3791046Z 2025-05-07T19:52:29.3791126Z HIPCC_FLAGS: 2025-05-07T19:52:29.3791130Z 2025-05-07T19:52:29.3791206Z 2025-05-07T19:52:29.3791279Z INCLUDE_DIRS: 2025-05-07T19:52:29.3791379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3791471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3791562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3791659Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3791916Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3792288Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3792416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3792650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3792824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3793187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3793388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3793545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3793847Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3793932Z 2025-05-07T19:52:29.3794019Z Selected Source Files: 2025-05-07T19:52:29.3794130Z gen_embedding_backward_split_adagrad.cpp 2025-05-07T19:52:29.3794261Z gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T19:52:29.3794376Z gen_embedding_backward_split_sgd.cpp 2025-05-07T19:52:29.3794481Z gen_embedding_backward_split_adam.cpp 2025-05-07T19:52:29.3794584Z gen_embedding_backward_split_lamb.cpp 2025-05-07T19:52:29.3794705Z gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T19:52:29.3794853Z gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T19:52:29.3795001Z gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T19:52:29.3795102Z gen_embedding_backward_split_none.cpp 2025-05-07T19:52:29.3795284Z gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:29.3795400Z gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T19:52:29.3795553Z gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T19:52:29.3795764Z gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T19:52:29.3795986Z gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:29.3796180Z gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T19:52:29.3796349Z gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T19:52:29.3796472Z gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T19:52:29.3796621Z gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:29.3796775Z gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:29.3797024Z gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T19:52:29.3797212Z gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T19:52:29.3797344Z gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T19:52:29.3797493Z gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:29.3797625Z gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T19:52:29.3797769Z gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T19:52:29.3797904Z gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T19:52:29.3798047Z gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:29.3798284Z gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T19:52:29.3798444Z gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T19:52:29.3798651Z gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T19:52:29.3798858Z gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T19:52:29.3799056Z gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T19:52:29.3799263Z gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T19:52:29.3799398Z gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T19:52:29.3799541Z gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T19:52:29.3799771Z gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T19:52:29.3800006Z gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T19:52:29.3800078Z 2025-05-07T19:52:29.3800164Z HIPified Source Files: 2025-05-07T19:52:29.3800180Z 2025-05-07T19:52:29.3800249Z 2025-05-07T19:52:29.3800337Z Library Dependencies: 2025-05-07T19:52:29.3800407Z torch 2025-05-07T19:52:29.3800495Z torch_library 2025-05-07T19:52:29.3800802Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3800970Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3801304Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3801652Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3801835Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3801917Z fbgemm_gpu_config 2025-05-07T19:52:29.3802004Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3802212Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3802281Z 2025-05-07T19:52:29.3802368Z Output Library: 2025-05-07T19:52:29.3802488Z fbgemm_gpu_tbe_training_backward_split_host 2025-05-07T19:52:29.3802557Z 2025-05-07T19:52:29.3802643Z Destination Directory: 2025-05-07T19:52:29.3802724Z fbgemm_gpu 2025-05-07T19:52:29.3802830Z ================================================================================ 2025-05-07T19:52:29.3802835Z 2025-05-07T19:52:29.3802933Z 2025-05-07T19:52:29.3802937Z 2025-05-07T19:52:29.3803047Z ================================================================================ 2025-05-07T19:52:29.3803222Z GPU CPP Library Target: fbgemm_gpu_tbe_index_select (SHARED) 2025-05-07T19:52:29.3803291Z 2025-05-07T19:52:29.3803366Z CPU_SRCS: 2025-05-07T19:52:29.3803580Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:52:29.3803765Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:52:29.3803835Z 2025-05-07T19:52:29.3803917Z GPU_SRCS: 2025-05-07T19:52:29.3804104Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:52:29.3804239Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:29.3804357Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:29.3804499Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:29.3804640Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:29.3804770Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:29.3804961Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:29.3805095Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:29.3805166Z 2025-05-07T19:52:29.3805248Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3805252Z 2025-05-07T19:52:29.3805437Z 2025-05-07T19:52:29.3805512Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3805516Z 2025-05-07T19:52:29.3805581Z 2025-05-07T19:52:29.3805657Z OTHER_SRCS: 2025-05-07T19:52:29.3805661Z 2025-05-07T19:52:29.3805727Z 2025-05-07T19:52:29.3805795Z CC_FLAGS: 2025-05-07T19:52:29.3805798Z 2025-05-07T19:52:29.3805869Z 2025-05-07T19:52:29.3805938Z NVCC_FLAGS: 2025-05-07T19:52:29.3806078Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3806166Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3806261Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3806345Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3806411Z 2025-05-07T19:52:29.3806486Z HIPCC_FLAGS: 2025-05-07T19:52:29.3806490Z 2025-05-07T19:52:29.3806554Z 2025-05-07T19:52:29.3806627Z INCLUDE_DIRS: 2025-05-07T19:52:29.3806724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3806815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3806907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3806999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3807264Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3807622Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3807748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3807896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3808045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3808230Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3808411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3808553Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3808833Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3808898Z 2025-05-07T19:52:29.3808991Z Selected Source Files: 2025-05-07T19:52:29.3809182Z codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T19:52:29.3809354Z codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T19:52:29.3809527Z codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T19:52:29.3809657Z gen_batch_index_select_dim0_forward_codegen_cuda.cu 2025-05-07T19:52:29.3809770Z gen_batch_index_select_dim0_forward_kernel.cu 2025-05-07T19:52:29.3809891Z gen_batch_index_select_dim0_forward_kernel_small.cu 2025-05-07T19:52:29.3810026Z gen_batch_index_select_dim0_backward_codegen_cuda.cu 2025-05-07T19:52:29.3810144Z gen_batch_index_select_dim0_backward_kernel_cta.cu 2025-05-07T19:52:29.3810266Z gen_batch_index_select_dim0_backward_kernel_warp.cu 2025-05-07T19:52:29.3810395Z gen_embedding_backward_split_grad_index_select.cu 2025-05-07T19:52:29.3810458Z 2025-05-07T19:52:29.3810538Z HIPified Source Files: 2025-05-07T19:52:29.3810543Z 2025-05-07T19:52:29.3810608Z 2025-05-07T19:52:29.3810699Z Library Dependencies: 2025-05-07T19:52:29.3810763Z torch 2025-05-07T19:52:29.3810831Z torch_library 2025-05-07T19:52:29.3811118Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3811267Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3811575Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3811905Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3812075Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3812164Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:29.3812241Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3812493Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3812559Z 2025-05-07T19:52:29.3812636Z Output Library: 2025-05-07T19:52:29.3812729Z fbgemm_gpu_tbe_index_select 2025-05-07T19:52:29.3812792Z 2025-05-07T19:52:29.3812871Z Destination Directory: 2025-05-07T19:52:29.3812940Z fbgemm_gpu 2025-05-07T19:52:29.3813049Z ================================================================================ 2025-05-07T19:52:29.3813053Z 2025-05-07T19:52:29.3813057Z 2025-05-07T19:52:29.3813060Z 2025-05-07T19:52:29.3813158Z ================================================================================ 2025-05-07T19:52:29.3813385Z GPU CPP Library Target: fbgemm_gpu_embedding_inplace_ops (SHARED) 2025-05-07T19:52:29.3813456Z 2025-05-07T19:52:29.3813526Z CPU_SRCS: 2025-05-07T19:52:29.3813685Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:52:29.3813759Z 2025-05-07T19:52:29.3813828Z GPU_SRCS: 2025-05-07T19:52:29.3813986Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:52:29.3814125Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:52:29.3814197Z 2025-05-07T19:52:29.3814272Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3814276Z 2025-05-07T19:52:29.3814339Z 2025-05-07T19:52:29.3814422Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3814426Z 2025-05-07T19:52:29.3814490Z 2025-05-07T19:52:29.3814557Z OTHER_SRCS: 2025-05-07T19:52:29.3814561Z 2025-05-07T19:52:29.3814626Z 2025-05-07T19:52:29.3814702Z CC_FLAGS: 2025-05-07T19:52:29.3814705Z 2025-05-07T19:52:29.3814769Z 2025-05-07T19:52:29.3814841Z NVCC_FLAGS: 2025-05-07T19:52:29.3814935Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3815024Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3815114Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3815198Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3815269Z 2025-05-07T19:52:29.3815339Z HIPCC_FLAGS: 2025-05-07T19:52:29.3815343Z 2025-05-07T19:52:29.3815407Z 2025-05-07T19:52:29.3815486Z INCLUDE_DIRS: 2025-05-07T19:52:29.3815586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3815668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3815760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3815861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3816119Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3816474Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3816609Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3816755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3816898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3817089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3817272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3817405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3817692Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3817757Z 2025-05-07T19:52:29.3817836Z Selected Source Files: 2025-05-07T19:52:29.3817993Z src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T19:52:29.3818155Z src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T19:52:29.3818296Z src/embedding_inplace_ops/embedding_inplace_update.cu 2025-05-07T19:52:29.3818360Z 2025-05-07T19:52:29.3818448Z HIPified Source Files: 2025-05-07T19:52:29.3818451Z 2025-05-07T19:52:29.3818518Z 2025-05-07T19:52:29.3818603Z Library Dependencies: 2025-05-07T19:52:29.3818667Z torch 2025-05-07T19:52:29.3818744Z torch_library 2025-05-07T19:52:29.3819022Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3819168Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3819550Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3819871Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3820039Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3820240Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3820305Z 2025-05-07T19:52:29.3820379Z Output Library: 2025-05-07T19:52:29.3820473Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:52:29.3820545Z 2025-05-07T19:52:29.3820625Z Destination Directory: 2025-05-07T19:52:29.3820695Z fbgemm_gpu 2025-05-07T19:52:29.3820851Z ================================================================================ 2025-05-07T19:52:29.3820855Z 2025-05-07T19:52:29.3820858Z 2025-05-07T19:52:29.3820861Z 2025-05-07T19:52:29.3820962Z ================================================================================ 2025-05-07T19:52:29.3821079Z GPU CPP Library Target: fbgemm_gpu_py (SHARED) 2025-05-07T19:52:29.3821154Z 2025-05-07T19:52:29.3821224Z CPU_SRCS: 2025-05-07T19:52:29.3821315Z src/memory_utils/memory_utils.cpp 2025-05-07T19:52:29.3821410Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:52:29.3821602Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:29.3821795Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:52:29.3821980Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:52:29.3822188Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:52:29.3822384Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:29.3822603Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:52:29.3822751Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:52:29.3822870Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:52:29.3822985Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:52:29.3823095Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:52:29.3823237Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:52:29.3823332Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:52:29.3823426Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:52:29.3823550Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:52:29.3823643Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:52:29.3823733Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:52:29.3823816Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:52:29.3823905Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:52:29.3824000Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:52:29.3824089Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:52:29.3824185Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:52:29.3824274Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:52:29.3824492Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:52:29.3824639Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:52:29.3824835Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:29.3825049Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:52:29.3825142Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:52:29.3825237Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:52:29.3825325Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:52:29.3825431Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:52:29.3825615Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:29.3825695Z src/topology_utils.cpp 2025-05-07T19:52:29.3825765Z 2025-05-07T19:52:29.3825836Z GPU_SRCS: 2025-05-07T19:52:29.3825947Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:52:29.3826042Z src/input_combine_ops/input_combine.cu 2025-05-07T19:52:29.3826236Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:52:29.3826331Z src/memory_utils/memory_utils.cu 2025-05-07T19:52:29.3826471Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:52:29.3826646Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:52:29.3826813Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:52:29.3826939Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:52:29.3827058Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:52:29.3827291Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:52:29.3827463Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:52:29.3827672Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:52:29.3827803Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:52:29.3827945Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:52:29.3828069Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:52:29.3828185Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:52:29.3828301Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:52:29.3828416Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:52:29.3828563Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:52:29.3828703Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:52:29.3828822Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:52:29.3828960Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:52:29.3829079Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:52:29.3829171Z src/metric_ops/metric_ops.cu 2025-05-07T19:52:29.3829377Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:52:29.3829555Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:52:29.3829723Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:52:29.3829826Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:52:29.3829930Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:52:29.3830045Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:52:29.3830163Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:52:29.3830255Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:52:29.3830341Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:52:29.3830456Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:52:29.3830552Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:52:29.3830664Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:52:29.3830787Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:52:29.3830898Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:52:29.3831023Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:52:29.3831154Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:52:29.3831448Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:52:29.3831724Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:52:29.3832002Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:52:29.3832107Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:52:29.3832219Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:52:29.3832344Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:52:29.3832469Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:52:29.3832645Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:52:29.3832752Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:52:29.3832852Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:52:29.3832966Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:52:29.3833068Z src/sparse_ops/sparse_range.cu 2025-05-07T19:52:29.3833189Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:52:29.3833297Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:52:29.3833400Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:52:29.3833470Z 2025-05-07T19:52:29.3833552Z CUDA_SPECIFIC_SRCS: 2025-05-07T19:52:29.3833557Z 2025-05-07T19:52:29.3833635Z 2025-05-07T19:52:29.3833718Z HIP_SPECIFIC_SRCS: 2025-05-07T19:52:29.3833837Z 2025-05-07T19:52:29.3833908Z 2025-05-07T19:52:29.3833982Z OTHER_SRCS: 2025-05-07T19:52:29.3833987Z 2025-05-07T19:52:29.3834064Z 2025-05-07T19:52:29.3834140Z CC_FLAGS: 2025-05-07T19:52:29.3834144Z 2025-05-07T19:52:29.3834213Z 2025-05-07T19:52:29.3834297Z NVCC_FLAGS: 2025-05-07T19:52:29.3834389Z --expt-relaxed-constexpr 2025-05-07T19:52:29.3834482Z -D__CUDA_NO_HALF_OPERATORS__ 2025-05-07T19:52:29.3834579Z -D__CUDA_NO_BFLOAT16_CONVERSIONS__ 2025-05-07T19:52:29.3834674Z -D__CUDA_NO_HALF2_OPERATORS__ 2025-05-07T19:52:29.3834742Z 2025-05-07T19:52:29.3834817Z HIPCC_FLAGS: 2025-05-07T19:52:29.3834821Z 2025-05-07T19:52:29.3834964Z 2025-05-07T19:52:29.3835041Z INCLUDE_DIRS: 2025-05-07T19:52:29.3835142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3835232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu 2025-05-07T19:52:29.3835336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include 2025-05-07T19:52:29.3835436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../include 2025-05-07T19:52:29.3835722Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include 2025-05-07T19:52:29.3836119Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include 2025-05-07T19:52:29.3836259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src 2025-05-07T19:52:29.3836415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include 2025-05-07T19:52:29.3836577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include 2025-05-07T19:52:29.3836776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include 2025-05-07T19:52:29.3836973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include 2025-05-07T19:52:29.3837118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include 2025-05-07T19:52:29.3837433Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include 2025-05-07T19:52:29.3837506Z 2025-05-07T19:52:29.3837590Z Selected Source Files: 2025-05-07T19:52:29.3837693Z src/memory_utils/memory_utils.cpp 2025-05-07T19:52:29.3837802Z src/memory_utils/memory_utils_ops.cpp 2025-05-07T19:52:29.3837999Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:29.3838206Z src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T19:52:29.3838421Z src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T19:52:29.3838634Z src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T19:52:29.3838844Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T19:52:29.3839083Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T19:52:29.3839233Z src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T19:52:29.3839360Z src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T19:52:29.3839492Z src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T19:52:29.3839608Z src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T19:52:29.3839754Z src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T19:52:29.3839859Z src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T19:52:29.3839970Z src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T19:52:29.3840096Z src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T19:52:29.3840193Z src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T19:52:29.3840296Z src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T19:52:29.3840387Z src/tbe/eeg/eeg_models.cpp 2025-05-07T19:52:29.3840473Z src/tbe/eeg/eeg_utils.cpp 2025-05-07T19:52:29.3840577Z src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T19:52:29.3840675Z src/tbe/eeg/indices_estimator.cpp 2025-05-07T19:52:29.3840777Z src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T19:52:29.3840875Z src/tbe/eeg/indices_generator.cpp 2025-05-07T19:52:29.3841117Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T19:52:29.3841264Z src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T19:52:29.3841477Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:29.3841773Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T19:52:29.3841876Z src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T19:52:29.3841972Z src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T19:52:29.3842071Z src/metric_ops/metric_ops_host.cpp 2025-05-07T19:52:29.3842200Z src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T19:52:29.3842395Z src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T19:52:29.3842480Z src/topology_utils.cpp 2025-05-07T19:52:29.3842598Z src/histogram_binning_calibration_ops.cu 2025-05-07T19:52:29.3842697Z src/input_combine_ops/input_combine.cu 2025-05-07T19:52:29.3842988Z src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu 2025-05-07T19:52:29.3843083Z src/memory_utils/memory_utils.cu 2025-05-07T19:52:29.3843181Z src/memory_utils/memory_utils_ops.cu 2025-05-07T19:52:29.3843375Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu 2025-05-07T19:52:29.3843560Z src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu 2025-05-07T19:52:29.3843689Z src/jagged_tensor_ops/dense_to_jagged_forward.cu 2025-05-07T19:52:29.3843828Z src/jagged_tensor_ops/jagged_dense_bmm_forward.cu 2025-05-07T19:52:29.3844078Z src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu 2025-05-07T19:52:29.3844255Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu 2025-05-07T19:52:29.3844428Z src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu 2025-05-07T19:52:29.3844576Z src/jagged_tensor_ops/jagged_index_add_2d_forward.cu 2025-05-07T19:52:29.3844726Z src/jagged_tensor_ops/jagged_index_select_2d_forward.cu 2025-05-07T19:52:29.3844973Z src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu 2025-05-07T19:52:29.3845099Z src/jagged_tensor_ops/jagged_softmax_backward.cu 2025-05-07T19:52:29.3845216Z src/jagged_tensor_ops/jagged_softmax_forward.cu 2025-05-07T19:52:29.3845318Z src/jagged_tensor_ops/jagged_tensor_ops.cu 2025-05-07T19:52:29.3845473Z src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu 2025-05-07T19:52:29.3845614Z src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu 2025-05-07T19:52:29.3845729Z src/jagged_tensor_ops/jagged_unique_indices.cu 2025-05-07T19:52:29.3845866Z src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu 2025-05-07T19:52:29.3845990Z src/layout_transform_ops/layout_transform_ops.cu 2025-05-07T19:52:29.3846078Z src/metric_ops/metric_ops.cu 2025-05-07T19:52:29.3846279Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu 2025-05-07T19:52:29.3846459Z src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu 2025-05-07T19:52:29.3846627Z src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu 2025-05-07T19:52:29.3846726Z src/quantize_ops/quantize_bfloat16.cu 2025-05-07T19:52:29.3846825Z src/quantize_ops/quantize_fp8_rowwise.cu 2025-05-07T19:52:29.3846950Z src/quantize_ops/quantize_fused_8bit_rowwise.cu 2025-05-07T19:52:29.3847065Z src/quantize_ops/quantize_fused_nbit_rowwise.cu 2025-05-07T19:52:29.3847158Z src/quantize_ops/quantize_hfp8.cu 2025-05-07T19:52:29.3847253Z src/quantize_ops/quantize_msfp.cu 2025-05-07T19:52:29.3847370Z src/quantize_ops/quantize_padded_fp8_rowwise.cu 2025-05-07T19:52:29.3847459Z src/quantize_ops/quantize_mx.cu 2025-05-07T19:52:29.3847576Z src/sparse_ops/sparse_async_batched_cumsum.cu 2025-05-07T19:52:29.3847699Z src/sparse_ops/sparse_block_bucketize_features.cu 2025-05-07T19:52:29.3847813Z src/sparse_ops/sparse_bucketize_features.cu 2025-05-07T19:52:29.3847935Z src/sparse_ops/sparse_batched_unary_embeddings.cu 2025-05-07T19:52:29.3848071Z src/sparse_ops/sparse_compute_frequency_sequence.cu 2025-05-07T19:52:29.3848200Z src/sparse_ops/sparse_expand_into_jagged_permute.cu 2025-05-07T19:52:29.3848295Z src/sparse_ops/sparse_group_index.cu 2025-05-07T19:52:29.3848394Z src/sparse_ops/sparse_index_add.cu 2025-05-07T19:52:29.3848486Z src/sparse_ops/sparse_index_select.cu 2025-05-07T19:52:29.3848585Z src/sparse_ops/sparse_invert_permute.cu 2025-05-07T19:52:29.3848705Z src/sparse_ops/sparse_pack_segments_backward.cu 2025-05-07T19:52:29.3848878Z src/sparse_ops/sparse_pack_segments_forward.cu 2025-05-07T19:52:29.3848976Z src/sparse_ops/sparse_permute_1d.cu 2025-05-07T19:52:29.3849069Z src/sparse_ops/sparse_permute_2d.cu 2025-05-07T19:52:29.3849168Z src/sparse_ops/sparse_permute102.cu 2025-05-07T19:52:29.3849275Z src/sparse_ops/sparse_permute_embeddings.cu 2025-05-07T19:52:29.3849363Z src/sparse_ops/sparse_range.cu 2025-05-07T19:52:29.3849480Z src/sparse_ops/sparse_reorder_batched_ad.cu 2025-05-07T19:52:29.3849582Z src/sparse_ops/sparse_segment_sum_csr.cu 2025-05-07T19:52:29.3849675Z src/sparse_ops/sparse_zipf.cu 2025-05-07T19:52:29.3849747Z 2025-05-07T19:52:29.3849890Z HIPified Source Files: 2025-05-07T19:52:29.3849894Z 2025-05-07T19:52:29.3849962Z 2025-05-07T19:52:29.3850047Z Library Dependencies: 2025-05-07T19:52:29.3850119Z torch 2025-05-07T19:52:29.3850190Z torch_library 2025-05-07T19:52:29.3850479Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so 2025-05-07T19:52:29.3850634Z /github/home/miniconda/envs/build_binary/lib/libnvrtc.so 2025-05-07T19:52:29.3850945Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so 2025-05-07T19:52:29.3851264Z /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 2025-05-07T19:52:29.3851438Z /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so 2025-05-07T19:52:29.3851512Z fbgemm 2025-05-07T19:52:29.3851606Z fbgemm_gpu_sparse_async_cumsum 2025-05-07T19:52:29.3851695Z fbgemm_gpu_embedding_inplace_ops 2025-05-07T19:52:29.3851788Z fbgemm_gpu_tbe_index_select 2025-05-07T19:52:29.3851875Z fbgemm_gpu_tbe_cache 2025-05-07T19:52:29.3851960Z fbgemm_gpu_tbe_optimizers 2025-05-07T19:52:29.3852036Z fbgemm_gpu_tbe_utils 2025-05-07T19:52:29.3852238Z /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so 2025-05-07T19:52:29.3852304Z 2025-05-07T19:52:29.3852378Z Output Library: 2025-05-07T19:52:29.3852448Z fbgemm_gpu_py 2025-05-07T19:52:29.3852520Z 2025-05-07T19:52:29.3852606Z Destination Directory: 2025-05-07T19:52:29.3852675Z fbgemm_gpu 2025-05-07T19:52:29.3852782Z ================================================================================ 2025-05-07T19:52:29.3852787Z 2025-05-07T19:52:29.3852871Z -- Configuring done (8.8s) 2025-05-07T19:52:29.4973825Z -- Generating done (0.1s) 2025-05-07T19:52:29.4991587Z -- Build files have been written to: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build 2025-05-07T19:52:29.5159423Z Change Dir: '/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build' 2025-05-07T19:52:29.5159444Z 2025-05-07T19:52:29.5159765Z Run Build Command(s): /github/home/miniconda/envs/build_binary/bin/ninja -v -j 48 install 2025-05-07T19:52:29.6214622Z [1/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp 2025-05-07T19:52:29.6215602Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6410784Z [2/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp 2025-05-07T19:52:29.6411768Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6454270Z [3/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp 2025-05-07T19:52:29.6455027Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6507423Z [4/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp 2025-05-07T19:52:29.6508173Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6655770Z [5/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp 2025-05-07T19:52:29.6657442Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6666649Z [6/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp 2025-05-07T19:52:29.6668045Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6722454Z [7/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp 2025-05-07T19:52:29.6723930Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6763244Z [8/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp 2025-05-07T19:52:29.6764866Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.6879858Z [9/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp 2025-05-07T19:52:29.6881250Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7057612Z [10/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp 2025-05-07T19:52:29.7068522Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7195525Z [11/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp 2025-05-07T19:52:29.7206647Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7223434Z [12/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp 2025-05-07T19:52:29.7234536Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7338888Z [13/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp 2025-05-07T19:52:29.7349645Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7420134Z [14/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp 2025-05-07T19:52:29.7431640Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7494957Z [15/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp 2025-05-07T19:52:29.7502990Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7509002Z [16/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp 2025-05-07T19:52:29.7515025Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7550964Z [17/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp 2025-05-07T19:52:29.7557212Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7602535Z [18/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp 2025-05-07T19:52:29.7613493Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7703565Z [19/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp 2025-05-07T19:52:29.7714326Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7758394Z [20/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp 2025-05-07T19:52:29.7768982Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.7907580Z [21/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp 2025-05-07T19:52:29.7918181Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8025519Z [22/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp 2025-05-07T19:52:29.8031844Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8365201Z [23/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp 2025-05-07T19:52:29.8376438Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8386773Z [24/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp 2025-05-07T19:52:29.8397835Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8431664Z [25/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp 2025-05-07T19:52:29.8442150Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8464252Z [26/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp 2025-05-07T19:52:29.8475227Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8485523Z [27/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp 2025-05-07T19:52:29.8496186Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8667572Z [28/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp 2025-05-07T19:52:29.8678866Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8691155Z [29/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp 2025-05-07T19:52:29.8702106Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8826667Z [30/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp 2025-05-07T19:52:29.8838347Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8848992Z [31/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp 2025-05-07T19:52:29.8860088Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8889660Z [32/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp 2025-05-07T19:52:29.8895940Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.8987643Z [33/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp 2025-05-07T19:52:29.8999494Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9068966Z [34/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp 2025-05-07T19:52:29.9090291Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9100765Z [35/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp 2025-05-07T19:52:29.9111421Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9219227Z [36/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp 2025-05-07T19:52:29.9229979Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9310848Z [37/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp 2025-05-07T19:52:29.9322592Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9544585Z [38/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp 2025-05-07T19:52:29.9556599Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9705511Z [39/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp 2025-05-07T19:52:29.9717553Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9841022Z [40/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp 2025-05-07T19:52:29.9852107Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:29.9975741Z [41/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp 2025-05-07T19:52:29.9987194Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.0234813Z [42/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp 2025-05-07T19:52:30.0245575Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.0370295Z [43/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp 2025-05-07T19:52:30.0380925Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.0530288Z [44/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp 2025-05-07T19:52:30.0541425Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.0722312Z [45/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp 2025-05-07T19:52:30.0733409Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.1496715Z [46/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp 2025-05-07T19:52:30.1508114Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.1857853Z [47/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp 2025-05-07T19:52:30.1869395Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.2108611Z [48/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp 2025-05-07T19:52:30.2120272Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.2421047Z [49/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp 2025-05-07T19:52:30.2432369Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.2538665Z [50/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp 2025-05-07T19:52:30.2550229Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.3432101Z [51/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp 2025-05-07T19:52:30.3443084Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.3454202Z [52/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp 2025-05-07T19:52:30.3465487Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.3943021Z [53/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp 2025-05-07T19:52:30.3954592Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.4591437Z [54/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp 2025-05-07T19:52:30.4602542Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.4750295Z [55/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc 2025-05-07T19:52:30.4759829Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.6150512Z [56/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -mavx512f -mavx512bw -mavx512dq -mavx512vl -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc 2025-05-07T19:52:30.6160340Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.6480491Z [57/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp 2025-05-07T19:52:30.6491875Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.6650201Z [58/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp 2025-05-07T19:52:30.6661268Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.8082779Z [59/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtils.cc 2025-05-07T19:52:30.8100202Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:30.9521649Z [60/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp 2025-05-07T19:52:30.9534076Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:31.0544321Z [61/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp 2025-05-07T19:52:31.0550525Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:31.2543093Z [62/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dasmjit_EXPORTS -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -MF CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o.d -o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o -c /__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp 2025-05-07T19:52:31.2555081Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:31.8859205Z [63/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o -c /__w/FBGEMM/FBGEMM/src/Utils.cc 2025-05-07T19:52:31.8876033Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:31.9503470Z [64/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,asmjit.so -o asmjit.so CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/a64rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/arm/armformatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/archtraits.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codeholder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/codewriter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/constpool.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/cpuinfo.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/emitterutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/environment.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/errorhandler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/funcargscontext.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/globals.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/inst.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitallocator.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/jitruntime.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/logger.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/osutils.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/ralocal.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rapass.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/rastack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/string.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/support.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/target.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/type.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/virtmem.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zone.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonehash.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonelist.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonestack.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonetree.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/core/zonevector.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86assembler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86builder.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86compiler.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86emithelper.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86formatter.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86func.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instapi.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86instdb.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86operand.cpp.o CMakeFiles/asmjit.dir/__w/FBGEMM/FBGEMM/external/asmjit/src/asmjit/x86/x86rapass.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:32.2275362Z [65/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o -c /__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc 2025-05-07T19:52:32.2291960Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:35.6706767Z [66/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o -c /__w/FBGEMM/FBGEMM/src/RefImplementations.cc 2025-05-07T19:52:35.6723574Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:35.8442861Z [67/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o -c /__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc 2025-05-07T19:52:35.8461337Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.3169658Z [68/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cpp 2025-05-07T19:52:37.3187054Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.3395486Z [69/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cpp 2025-05-07T19:52:37.3413912Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.3806312Z [70/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cpp 2025-05-07T19:52:37.3822452Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.4232204Z [71/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cpp 2025-05-07T19:52:37.4249469Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:37.9115589Z [72/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc 2025-05-07T19:52:37.9133652Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:38.2403536Z [73/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cpp 2025-05-07T19:52:38.2421660Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:38.5357914Z [74/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host.cpp 2025-05-07T19:52:38.5375519Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:39.4702439Z [75/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp 2025-05-07T19:52:39.4720238Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:40.1114627Z [76/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_config_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -MF CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o.d -o CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/config/feature_gates.cpp 2025-05-07T19:52:40.1132042Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:40.7317991Z [77/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_config.so -o fbgemm_gpu_config.so CMakeFiles/fbgemm_gpu_config.dir/src/config/feature_gates.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed && : 2025-05-07T19:52:41.0523819Z [78/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o -c /__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc 2025-05-07T19:52:41.5234458Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:41.5251933Z [79/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp 2025-05-07T19:52:41.5269501Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:44.0029642Z [80/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_meta.cpp 2025-05-07T19:52:44.0047098Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:44.3161079Z [81/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp 2025-05-07T19:52:44.3174660Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:45.4127773Z [82/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/split_embeddings_utils.cpp 2025-05-07T19:52:45.4145081Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:47.9296236Z [83/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cpp 2025-05-07T19:52:47.9314606Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:48.0099818Z [84/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_host_cpu.cpp 2025-05-07T19:52:48.0117957Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:48.3555637Z [85/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/pt2/pt2_autograd_utils.cpp 2025-05-07T19:52:48.3573756Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:51.0449666Z [86/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host.cpp 2025-05-07T19:52:51.0467678Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:51.4152701Z [87/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_host_cpu.cpp 2025-05-07T19:52:51.4170531Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:55.2282149Z [88/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc 2025-05-07T19:52:55.2297828Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:56.8197410Z [89/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split.cpp 2025-05-07T19:52:56.8213589Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:57.1759924Z [90/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/forward/embedding_forward_split_cpu.cpp 2025-05-07T19:52:57.1774497Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:59.4691934Z [91/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_codegen_meta.cpp 2025-05-07T19:52:59.4711865Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:52:59.7830870Z [92/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_codegen_meta.cpp 2025-05-07T19:52:59.7849616Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:02.0999157Z [93/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_pt2_cpu_wrapper.cpp 2025-05-07T19:53:02.1017205Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:03.5276121Z [94/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_codegen_meta.cpp 2025-05-07T19:53:03.5294381Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:05.0405793Z [95/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_codegen_meta.cpp 2025-05-07T19:53:05.0423626Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:05.3957667Z [96/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_codegen_meta.cpp 2025-05-07T19:53:05.3970701Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:07.9014121Z [97/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp 2025-05-07T19:53:07.9032831Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:08.0354185Z [98/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp 2025-05-07T19:53:08.0371885Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:08.3686315Z [99/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/inference/embedding_forward_quantized_split_lookup.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o 2025-05-07T19:53:08.3707629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.3709535Z 2025-05-07T19:53:08.3711599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.3713689Z 2025-05-07T19:53:08.3715331Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.3717095Z 2025-05-07T19:53:08.3718628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.3720812Z 2025-05-07T19:53:08.3722248Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.3723984Z 2025-05-07T19:53:08.3725545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.3727413Z 2025-05-07T19:53:08.8312969Z [100/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o 2025-05-07T19:53:08.8333836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.8336022Z 2025-05-07T19:53:08.8337694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.8339566Z 2025-05-07T19:53:08.8341138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.8343027Z 2025-05-07T19:53:08.8344827Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.8346628Z 2025-05-07T19:53:08.8348270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.8350008Z 2025-05-07T19:53:08.8351613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.8353562Z 2025-05-07T19:53:08.9290813Z [101/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o 2025-05-07T19:53:08.9313645Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9315564Z 2025-05-07T19:53:08.9317226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9319180Z 2025-05-07T19:53:08.9320479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu(13): warning #177-D: variable "::TORCH_LIBRARY_FRAGMENT_static_init_fbgemm_2" was declared but never referenced 2025-05-07T19:53:08.9321885Z 2025-05-07T19:53:08.9323082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9324433Z 2025-05-07T19:53:08.9325665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9327042Z 2025-05-07T19:53:08.9328108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/split_embeddings_cache_ops.cu(13): warning #177-D: variable "::TORCH_LIBRARY_FRAGMENT_static_init_fbgemm_2" was declared but never referenced 2025-05-07T19:53:08.9329310Z 2025-05-07T19:53:08.9330513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9332184Z 2025-05-07T19:53:08.9333408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9334785Z 2025-05-07T19:53:08.9464350Z [102/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/reset_weight_momentum.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o 2025-05-07T19:53:08.9487464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9489668Z 2025-05-07T19:53:08.9491423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9493420Z 2025-05-07T19:53:08.9495162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9497125Z 2025-05-07T19:53:08.9498865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9500841Z 2025-05-07T19:53:08.9502583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9504547Z 2025-05-07T19:53:08.9506311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:08.9508287Z 2025-05-07T19:53:09.2302576Z [103/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o 2025-05-07T19:53:09.2322811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.2324798Z 2025-05-07T19:53:09.2326243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.2327630Z 2025-05-07T19:53:09.2328962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.2330403Z 2025-05-07T19:53:09.2332175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.2333797Z 2025-05-07T19:53:09.2335283Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.2336936Z 2025-05-07T19:53:09.2338497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.2340375Z 2025-05-07T19:53:09.4732791Z [104/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lxu_cache.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o 2025-05-07T19:53:09.4755488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.4757672Z 2025-05-07T19:53:09.4759379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.4761341Z 2025-05-07T19:53:09.4763106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.4765032Z 2025-05-07T19:53:09.4766775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.4768602Z 2025-05-07T19:53:09.4770315Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.4772244Z 2025-05-07T19:53:09.4773951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.4775843Z 2025-05-07T19:53:09.5240981Z [105/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o 2025-05-07T19:53:09.5263166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.5265378Z 2025-05-07T19:53:09.5267026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.5268932Z 2025-05-07T19:53:09.5270642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.5272529Z 2025-05-07T19:53:09.5274369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.5276260Z 2025-05-07T19:53:09.5277928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.5279845Z 2025-05-07T19:53:09.5281615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.5283570Z 2025-05-07T19:53:09.9857273Z [106/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_populate_byte.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o 2025-05-07T19:53:09.9880133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.9882127Z 2025-05-07T19:53:09.9883852Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.9885796Z 2025-05-07T19:53:09.9887479Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.9889387Z 2025-05-07T19:53:09.9891077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.9893010Z 2025-05-07T19:53:09.9894704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.9896287Z 2025-05-07T19:53:09.9897918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:09.9899771Z 2025-05-07T19:53:10.3895659Z [107/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp 2025-05-07T19:53:10.3915435Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:12.0374733Z [108/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:53:12.0395125Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:12.2212529Z [109/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/generate_vbe_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o 2025-05-07T19:53:13.3146684Z [110/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp 2025-05-07T19:53:13.3165178Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:14.1466170Z [111/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/get_infos_metadata.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o 2025-05-07T19:53:14.1486280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.1488104Z 2025-05-07T19:53:14.1489701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.1491426Z 2025-05-07T19:53:14.1492942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.1494713Z 2025-05-07T19:53:14.1496257Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.1497969Z 2025-05-07T19:53:14.1499476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.1501337Z 2025-05-07T19:53:14.1502949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:14.1504696Z 2025-05-07T19:53:19.2781115Z [112/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lfu_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o 2025-05-07T19:53:19.2800990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2802761Z 2025-05-07T19:53:19.2804242Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2805909Z 2025-05-07T19:53:19.2807364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2809035Z 2025-05-07T19:53:19.2810525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2812327Z 2025-05-07T19:53:19.2813752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2815345Z 2025-05-07T19:53:19.2816773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:19.2818398Z 2025-05-07T19:53:20.8323796Z [113/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v1.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o 2025-05-07T19:53:20.8346979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.8349025Z 2025-05-07T19:53:20.8350814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.8352917Z 2025-05-07T19:53:20.8354691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.8356648Z 2025-05-07T19:53:20.8358419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.8360421Z 2025-05-07T19:53:20.8362175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.8364177Z 2025-05-07T19:53:20.8365936Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:20.8367928Z 2025-05-07T19:53:21.3269547Z [114/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_pt2_cuda_wrapper.cpp 2025-05-07T19:53:21.3280047Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:21.5714662Z [115/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_common_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/utils/embedding_bounds_check_v2.cu -o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o 2025-05-07T19:53:21.5737773Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.5739797Z 2025-05-07T19:53:21.5741574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.5743545Z 2025-05-07T19:53:21.5745239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.5747435Z 2025-05-07T19:53:21.5749162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.5751127Z 2025-05-07T19:53:21.5752948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.5754874Z 2025-05-07T19:53:21.5756617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:21.5758604Z 2025-05-07T19:53:22.2045142Z [116/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp 2025-05-07T19:53:22.2063932Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:23.4753869Z [117/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/lru_cache_find.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o 2025-05-07T19:53:23.4775091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.4776959Z 2025-05-07T19:53:23.4778340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.4780183Z 2025-05-07T19:53:23.4781656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.4783427Z 2025-05-07T19:53:23.4785103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.4786793Z 2025-05-07T19:53:23.4788488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.4790176Z 2025-05-07T19:53:23.4791658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:23.4793543Z 2025-05-07T19:53:26.3984169Z [118/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o 2025-05-07T19:53:26.4006277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.4008086Z 2025-05-07T19:53:26.4009683Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.4011263Z 2025-05-07T19:53:26.4012610Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.4014101Z 2025-05-07T19:53:26.4015636Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.4017334Z 2025-05-07T19:53:26.4018820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.4020429Z 2025-05-07T19:53:26.4021830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:26.4023639Z 2025-05-07T19:53:27.7199431Z [119/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o 2025-05-07T19:53:27.7216969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.7218417Z 2025-05-07T19:53:27.7219662Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.7221075Z 2025-05-07T19:53:27.7222289Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.7223652Z 2025-05-07T19:53:27.7224895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.7226283Z 2025-05-07T19:53:27.7227788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.7229188Z 2025-05-07T19:53:27.7230698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:27.7232366Z 2025-05-07T19:53:28.7824984Z [120/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o 2025-05-07T19:53:28.7847264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7849087Z 2025-05-07T19:53:28.7850680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7852435Z 2025-05-07T19:53:28.7854082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7855844Z 2025-05-07T19:53:28.7857405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7859243Z 2025-05-07T19:53:28.7861279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7863161Z 2025-05-07T19:53:28.7864726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.7866512Z 2025-05-07T19:53:28.9155069Z [121/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o 2025-05-07T19:53:28.9177799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.9179680Z 2025-05-07T19:53:28.9181319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.9183179Z 2025-05-07T19:53:28.9184861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.9186732Z 2025-05-07T19:53:28.9188630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.9190253Z 2025-05-07T19:53:28.9191717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.9193678Z 2025-05-07T19:53:28.9195347Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:28.9197344Z 2025-05-07T19:53:30.5100802Z [122/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_sparse_async_cumsum_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_cumsum.cu -o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o 2025-05-07T19:53:30.5121228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.5123089Z 2025-05-07T19:53:30.5124694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.5126535Z 2025-05-07T19:53:30.5128109Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.5129979Z 2025-05-07T19:53:30.5131914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.5134071Z 2025-05-07T19:53:30.5135694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.5137481Z 2025-05-07T19:53:30.5139089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:30.5140942Z 2025-05-07T19:53:39.0022080Z [123/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_cache_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_cache/linearize_cache_indices.cu -o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o 2025-05-07T19:53:39.0034039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.0035028Z 2025-05-07T19:53:39.0035895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.0036869Z 2025-05-07T19:53:39.0037744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.0038720Z 2025-05-07T19:53:39.0039842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.0040854Z 2025-05-07T19:53:39.0041714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.0042696Z 2025-05-07T19:53:39.0043560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:39.0044656Z 2025-05-07T19:53:39.6926011Z [124/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_cache.so -o fbgemm_gpu_tbe_cache.so CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lfu_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/linearize_cache_indices.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_find.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lru_cache_populate_byte.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/lxu_cache.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/reset_weight_momentum.cu.o CMakeFiles/fbgemm_gpu_tbe_cache.dir/src/split_embeddings_cache/split_embeddings_cache_ops.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:46.6599447Z [125/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_optimizers_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o 2025-05-07T19:53:46.6623571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.6625734Z 2025-05-07T19:53:46.6627536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.6629606Z 2025-05-07T19:53:46.6631681Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.6633824Z 2025-05-07T19:53:46.6635427Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.6637451Z 2025-05-07T19:53:46.6639199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.6641671Z 2025-05-07T19:53:46.6643394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.6645333Z 2025-05-07T19:53:46.9111682Z [126/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/transpose_embedding_input.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o 2025-05-07T19:53:46.9135161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9137339Z 2025-05-07T19:53:46.9139149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9141131Z 2025-05-07T19:53:46.9142887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9144890Z 2025-05-07T19:53:46.9146687Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9148703Z 2025-05-07T19:53:46.9150354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9152205Z 2025-05-07T19:53:46.9154498Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:46.9156475Z 2025-05-07T19:53:47.2913507Z [127/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_optimizers.so -o fbgemm_gpu_tbe_optimizers.so CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split.cpp.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_optimizers.dir/gen_embedding_optimizer_rowwise_adagrad_split_kernel.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:53:49.7171095Z [128/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -MF CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o.d -o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o -c /__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc 2025-05-07T19:53:49.7188051Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:53:50.0133712Z [129/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o 2025-05-07T19:53:50.0157415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.0159293Z 2025-05-07T19:53:50.0161098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.0163034Z 2025-05-07T19:53:50.0165199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.0167175Z 2025-05-07T19:53:50.0168916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.0170885Z 2025-05-07T19:53:50.0172578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.0174852Z 2025-05-07T19:53:50.0176519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.0178534Z 2025-05-07T19:53:50.3848792Z [130/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o 2025-05-07T19:53:50.3871453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.3873415Z 2025-05-07T19:53:50.3875511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.3877291Z 2025-05-07T19:53:50.3878790Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3880509Z 2025-05-07T19:53:50.3882097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3883964Z 2025-05-07T19:53:50.3885914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3887789Z 2025-05-07T19:53:50.3889529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.3891448Z 2025-05-07T19:53:50.3893172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.3895132Z 2025-05-07T19:53:50.3896804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3898568Z 2025-05-07T19:53:50.3900014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3901762Z 2025-05-07T19:53:50.3903335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3905160Z 2025-05-07T19:53:50.3906464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.3908254Z 2025-05-07T19:53:50.3909577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:50.3911256Z 2025-05-07T19:53:50.3912906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3914596Z 2025-05-07T19:53:50.3916043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3917731Z 2025-05-07T19:53:50.3919163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:50.3920807Z 2025-05-07T19:53:50.4713818Z [131/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm.so -o fbgemm.so CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDM.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAutovec.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMNBit.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RefImplementations.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/RowWiseSparseAdagradFused.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/SparseAdagrad.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/Utils.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/QuantUtilsAvx2.cc.o CMakeFiles/fbgemm.dir/__w/FBGEMM/FBGEMM/src/EmbeddingSpMDMAvx512.cc.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so asmjit.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T19:53:51.1567373Z [132/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_common.so -o fbgemm_gpu_tbe_common.so CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/forward/embedding_forward_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/training/pt2/pt2_autograd_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v1.cu.o CMakeFiles/fbgemm_gpu_tbe_common.dir/codegen/utils/embedding_bounds_check_v2.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T19:53:53.7327995Z [133/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:53.7351748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.7353828Z 2025-05-07T19:53:53.7356772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.7358699Z 2025-05-07T19:53:53.7360286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7362072Z 2025-05-07T19:53:53.7363591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7365606Z 2025-05-07T19:53:53.7367176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7369003Z 2025-05-07T19:53:53.7370629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7372447Z 2025-05-07T19:53:53.7374224Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.7376133Z 2025-05-07T19:53:53.7377804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.7379646Z 2025-05-07T19:53:53.7381228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7382865Z 2025-05-07T19:53:53.7384397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7386219Z 2025-05-07T19:53:53.7387748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7389119Z 2025-05-07T19:53:53.7390729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7392340Z 2025-05-07T19:53:53.7394103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.7395911Z 2025-05-07T19:53:53.7397535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.7399368Z 2025-05-07T19:53:53.7400895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7402727Z 2025-05-07T19:53:53.7404363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7406101Z 2025-05-07T19:53:53.7407537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7409134Z 2025-05-07T19:53:53.7410623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.7412240Z 2025-05-07T19:53:53.8119940Z [134/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o 2025-05-07T19:53:53.8145236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8146995Z 2025-05-07T19:53:53.8148536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8150380Z 2025-05-07T19:53:53.8151738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8153557Z 2025-05-07T19:53:53.8155442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8157054Z 2025-05-07T19:53:53.8158416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8159970Z 2025-05-07T19:53:53.8161363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8163305Z 2025-05-07T19:53:53.8164873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8166644Z 2025-05-07T19:53:53.8168279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8169874Z 2025-05-07T19:53:53.8171262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8172890Z 2025-05-07T19:53:53.8174392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8176105Z 2025-05-07T19:53:53.8177661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8179234Z 2025-05-07T19:53:53.8180665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8182332Z 2025-05-07T19:53:53.8183989Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8185854Z 2025-05-07T19:53:53.8187495Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:53:53.8189243Z 2025-05-07T19:53:53.8190678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8192440Z 2025-05-07T19:53:53.8194123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8195870Z 2025-05-07T19:53:53.8197412Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8199242Z 2025-05-07T19:53:53.8201031Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:53:53.8202699Z 2025-05-07T19:54:00.5565849Z [135/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o 2025-05-07T19:54:00.5577908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5578906Z 2025-05-07T19:54:00.5579775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5580767Z 2025-05-07T19:54:00.5581616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5582583Z 2025-05-07T19:54:00.5583470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5584472Z 2025-05-07T19:54:00.5585341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5586320Z 2025-05-07T19:54:00.5587352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.5588348Z 2025-05-07T19:54:00.7514558Z [136/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o 2025-05-07T19:54:00.7537701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.7539555Z 2025-05-07T19:54:00.7541199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.7543033Z 2025-05-07T19:54:00.7544671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.7546469Z 2025-05-07T19:54:00.7548119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.7550048Z 2025-05-07T19:54:00.7554267Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.7556294Z 2025-05-07T19:54:00.7557959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:00.7559838Z 2025-05-07T19:54:01.9838285Z [137/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o 2025-05-07T19:54:01.9850357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.9851336Z 2025-05-07T19:54:01.9852223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.9853206Z 2025-05-07T19:54:01.9854072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.9855056Z 2025-05-07T19:54:01.9855913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.9856893Z 2025-05-07T19:54:01.9857904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.9858874Z 2025-05-07T19:54:01.9859757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:01.9860737Z 2025-05-07T19:54:04.5697395Z [138/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:04.5721326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:04.5723296Z 2025-05-07T19:54:04.5725063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:04.5727097Z 2025-05-07T19:54:04.5728851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:04.5730837Z 2025-05-07T19:54:04.5733127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:04.5735133Z 2025-05-07T19:54:04.5736862Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:04.5738802Z 2025-05-07T19:54:04.5740579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:04.5742708Z 2025-05-07T19:54:07.4981163Z [139/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_utils_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -MD -MT CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/split_embeddings_utils/radix_sort_pairs.cu -o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o 2025-05-07T19:54:07.5000916Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.5002659Z 2025-05-07T19:54:07.5004214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.5005989Z 2025-05-07T19:54:07.5007505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.5009297Z 2025-05-07T19:54:07.5011362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.5013069Z 2025-05-07T19:54:07.5014597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.5016306Z 2025-05-07T19:54:07.5017849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:07.5019832Z 2025-05-07T19:54:08.1729966Z [140/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_utils.so -o fbgemm_gpu_tbe_utils.so CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/split_embeddings_utils.cpp.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/generate_vbe_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/get_infos_metadata.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/radix_sort_pairs.cu.o CMakeFiles/fbgemm_gpu_tbe_utils.dir/src/split_embeddings_utils/transpose_embedding_input.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:54:08.8209939Z [141/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_sparse_async_cumsum.so -o fbgemm_gpu_sparse_async_cumsum.so CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cpp.o CMakeFiles/fbgemm_gpu_sparse_async_cumsum.dir/src/sparse_ops/sparse_async_cumsum.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:54:09.6078453Z [142/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o 2025-05-07T19:54:09.6100917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.6102756Z 2025-05-07T19:54:09.6104461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.6106657Z 2025-05-07T19:54:09.6108273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.6110159Z 2025-05-07T19:54:09.6111809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.6113782Z 2025-05-07T19:54:09.6115422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.6117232Z 2025-05-07T19:54:09.6118892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:09.6120807Z 2025-05-07T19:54:17.6198465Z [143/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:17.6218747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.6220531Z 2025-05-07T19:54:17.6222493Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.6224394Z 2025-05-07T19:54:17.6226086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.6228004Z 2025-05-07T19:54:17.6229685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.6231829Z 2025-05-07T19:54:17.6233566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.6235410Z 2025-05-07T19:54:17.6237054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.6238743Z 2025-05-07T19:54:17.8204249Z [144/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:17.8227484Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.8229854Z 2025-05-07T19:54:17.8231928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.8233997Z 2025-05-07T19:54:17.8235737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.8237673Z 2025-05-07T19:54:17.8239416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.8241400Z 2025-05-07T19:54:17.8243160Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.8245141Z 2025-05-07T19:54:17.8246893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:17.8248882Z 2025-05-07T19:54:24.8849854Z [145/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:24.8873101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.8875073Z 2025-05-07T19:54:24.8876850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.8878847Z 2025-05-07T19:54:24.8880649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.8882634Z 2025-05-07T19:54:24.8884372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.8886274Z 2025-05-07T19:54:24.8887980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.8889958Z 2025-05-07T19:54:24.8891718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:24.8893645Z 2025-05-07T19:54:28.9814339Z [146/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o 2025-05-07T19:54:28.9836463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.9838387Z 2025-05-07T19:54:28.9840105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.9842007Z 2025-05-07T19:54:28.9843666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.9845496Z 2025-05-07T19:54:28.9847141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.9848905Z 2025-05-07T19:54:28.9850580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.9852528Z 2025-05-07T19:54:28.9854222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:28.9856106Z 2025-05-07T19:54:39.3796699Z [147/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_adagrad_pt2_autograd.cpp 2025-05-07T19:54:39.3815648Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:39.7775662Z [148/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o 2025-05-07T19:54:39.7798908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.7800846Z 2025-05-07T19:54:39.7802527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.7804522Z 2025-05-07T19:54:39.7806166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.7808275Z 2025-05-07T19:54:39.7809986Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.7811897Z 2025-05-07T19:54:39.7813561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.7815412Z 2025-05-07T19:54:39.7817276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.7819250Z 2025-05-07T19:54:39.8253137Z [149/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o 2025-05-07T19:54:39.8277288Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.8279299Z 2025-05-07T19:54:39.8281054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.8283439Z 2025-05-07T19:54:39.8285119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8287027Z 2025-05-07T19:54:39.8288711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8290540Z 2025-05-07T19:54:39.8292209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8294238Z 2025-05-07T19:54:39.8295930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.8297823Z 2025-05-07T19:54:39.8299575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.8301581Z 2025-05-07T19:54:39.8303272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8305172Z 2025-05-07T19:54:39.8306792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8308688Z 2025-05-07T19:54:39.8310341Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8312250Z 2025-05-07T19:54:39.8314139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.8316079Z 2025-05-07T19:54:39.8317828Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.8319791Z 2025-05-07T19:54:39.8321420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8323280Z 2025-05-07T19:54:39.8324894Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8326590Z 2025-05-07T19:54:39.8328271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:39.8330208Z 2025-05-07T19:54:39.9826134Z [150/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:54:39.9848912Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9850854Z 2025-05-07T19:54:39.9852457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9854428Z 2025-05-07T19:54:39.9856036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9857872Z 2025-05-07T19:54:39.9859523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9861487Z 2025-05-07T19:54:39.9863199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9865097Z 2025-05-07T19:54:39.9866798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:39.9868744Z 2025-05-07T19:54:40.1817799Z [151/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:40.1842569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.1844580Z 2025-05-07T19:54:40.1846385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.1848393Z 2025-05-07T19:54:40.1850156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.1852142Z 2025-05-07T19:54:40.1853910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.1855912Z 2025-05-07T19:54:40.1857669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.1859638Z 2025-05-07T19:54:40.1861425Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:40.1863809Z 2025-05-07T19:54:42.0871319Z [152/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o 2025-05-07T19:54:42.0894584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0896596Z 2025-05-07T19:54:42.0898322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.0900291Z 2025-05-07T19:54:42.0901624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0906358Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0909861Z (955): here 2025-05-07T19:54:42.0910089Z 2025-05-07T19:54:42.0911878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0916742Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0920158Z (1007): here 2025-05-07T19:54:42.0920392Z 2025-05-07T19:54:42.0925218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0929878Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0933488Z (1059): here 2025-05-07T19:54:42.0933720Z 2025-05-07T19:54:42.0934933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0939383Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0942708Z (1111): here 2025-05-07T19:54:42.0942920Z 2025-05-07T19:54:42.0944189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0948665Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0952165Z (1163): here 2025-05-07T19:54:42.0952395Z 2025-05-07T19:54:42.0953764Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0958198Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0961379Z (1215): here 2025-05-07T19:54:42.0961604Z 2025-05-07T19:54:42.0962854Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0967717Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0971100Z (1267): here 2025-05-07T19:54:42.0971312Z 2025-05-07T19:54:42.0972608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0977305Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0980781Z (1319): here 2025-05-07T19:54:42.0981016Z 2025-05-07T19:54:42.0982357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0987006Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.0990378Z (1371): here 2025-05-07T19:54:42.0990598Z 2025-05-07T19:54:42.0991925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.0996782Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1000227Z (1423): here 2025-05-07T19:54:42.1000455Z 2025-05-07T19:54:42.1001745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1006375Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1009887Z (1475): here 2025-05-07T19:54:42.1010122Z 2025-05-07T19:54:42.1011476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1016351Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1019936Z (1527): here 2025-05-07T19:54:42.1020179Z 2025-05-07T19:54:42.1021542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1026449Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1029931Z (1579): here 2025-05-07T19:54:42.1030153Z 2025-05-07T19:54:42.1031676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1036101Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1039697Z (1631): here 2025-05-07T19:54:42.1039935Z 2025-05-07T19:54:42.1041295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1045633Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1049003Z (1683): here 2025-05-07T19:54:42.1049227Z 2025-05-07T19:54:42.1050587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1055354Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1058671Z (1735): here 2025-05-07T19:54:42.1058909Z 2025-05-07T19:54:42.1060093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1065081Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1068579Z (1787): here 2025-05-07T19:54:42.1068802Z 2025-05-07T19:54:42.1070107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1075047Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1078541Z (1839): here 2025-05-07T19:54:42.1078761Z 2025-05-07T19:54:42.1080073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1084684Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1088015Z (1891): here 2025-05-07T19:54:42.1088240Z 2025-05-07T19:54:42.1089600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1094407Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1097984Z (1943): here 2025-05-07T19:54:42.1098223Z 2025-05-07T19:54:42.1099577Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1104200Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1107567Z (1995): here 2025-05-07T19:54:42.1107770Z 2025-05-07T19:54:42.1108981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1114205Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1117674Z (2047): here 2025-05-07T19:54:42.1117901Z 2025-05-07T19:54:42.1119265Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1124110Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1127857Z (2099): here 2025-05-07T19:54:42.1128080Z 2025-05-07T19:54:42.1129453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1134510Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1137749Z (2151): here 2025-05-07T19:54:42.1137976Z 2025-05-07T19:54:42.1139768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.1141779Z 2025-05-07T19:54:42.1143584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.1145608Z 2025-05-07T19:54:42.1146981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1151493Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1154634Z (955): here 2025-05-07T19:54:42.1154851Z 2025-05-07T19:54:42.1156122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1160664Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1164491Z (1007): here 2025-05-07T19:54:42.1164723Z 2025-05-07T19:54:42.1166044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1170647Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1174120Z (1059): here 2025-05-07T19:54:42.1174359Z 2025-05-07T19:54:42.1175668Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1180298Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1183550Z (1111): here 2025-05-07T19:54:42.1183774Z 2025-05-07T19:54:42.1185038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1189590Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1192935Z (1163): here 2025-05-07T19:54:42.1193120Z 2025-05-07T19:54:42.1194400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1198954Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1202337Z (1215): here 2025-05-07T19:54:42.1202557Z 2025-05-07T19:54:42.1203892Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1208480Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1211935Z (1267): here 2025-05-07T19:54:42.1212160Z 2025-05-07T19:54:42.1213712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1218270Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1221926Z (1319): here 2025-05-07T19:54:42.1222173Z 2025-05-07T19:54:42.1223467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1227769Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1231450Z (1371): here 2025-05-07T19:54:42.1231673Z 2025-05-07T19:54:42.1232967Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1237359Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1240597Z (1423): here 2025-05-07T19:54:42.1240798Z 2025-05-07T19:54:42.1242052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1246508Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1249815Z (1475): here 2025-05-07T19:54:42.1250018Z 2025-05-07T19:54:42.1251156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1255932Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1259484Z (1527): here 2025-05-07T19:54:42.1259719Z 2025-05-07T19:54:42.1261494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1266149Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1269605Z (1579): here 2025-05-07T19:54:42.1269815Z 2025-05-07T19:54:42.1271082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1275774Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1279179Z (1631): here 2025-05-07T19:54:42.1279399Z 2025-05-07T19:54:42.1280722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1285428Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1288792Z (1683): here 2025-05-07T19:54:42.1289000Z 2025-05-07T19:54:42.1290317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1294120Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1297201Z (1735): here 2025-05-07T19:54:42.1297418Z 2025-05-07T19:54:42.1298583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1302849Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1306322Z (1787): here 2025-05-07T19:54:42.1306560Z 2025-05-07T19:54:42.1308147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1313152Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1316746Z (1839): here 2025-05-07T19:54:42.1317121Z 2025-05-07T19:54:42.1318476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1323206Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1326718Z (1891): here 2025-05-07T19:54:42.1326937Z 2025-05-07T19:54:42.1328306Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1333292Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1336880Z (1943): here 2025-05-07T19:54:42.1337112Z 2025-05-07T19:54:42.1338458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1343211Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1346750Z (1995): here 2025-05-07T19:54:42.1346988Z 2025-05-07T19:54:42.1348357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1353342Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1356886Z (2047): here 2025-05-07T19:54:42.1357105Z 2025-05-07T19:54:42.1358782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1363601Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1367129Z (2099): here 2025-05-07T19:54:42.1367354Z 2025-05-07T19:54:42.1368718Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1373691Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1376318Z (2151): here 2025-05-07T19:54:42.1376496Z 2025-05-07T19:54:42.1378012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.1379829Z 2025-05-07T19:54:42.1381454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:42.1383181Z 2025-05-07T19:54:42.1384363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1388829Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1392335Z (955): here 2025-05-07T19:54:42.1392549Z 2025-05-07T19:54:42.1394024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1398629Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1402070Z (1007): here 2025-05-07T19:54:42.1402287Z 2025-05-07T19:54:42.1403605Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1408487Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1411839Z (1059): here 2025-05-07T19:54:42.1412071Z 2025-05-07T19:54:42.1413385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1417963Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1421147Z (1111): here 2025-05-07T19:54:42.1421361Z 2025-05-07T19:54:42.1422582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1426609Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1429984Z (1163): here 2025-05-07T19:54:42.1430206Z 2025-05-07T19:54:42.1431801Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1436488Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1439892Z (1215): here 2025-05-07T19:54:42.1440121Z 2025-05-07T19:54:42.1441437Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1446018Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1449376Z (1267): here 2025-05-07T19:54:42.1449591Z 2025-05-07T19:54:42.1450897Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1455733Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1459216Z (1319): here 2025-05-07T19:54:42.1459452Z 2025-05-07T19:54:42.1460772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1465406Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1469038Z (1371): here 2025-05-07T19:54:42.1469262Z 2025-05-07T19:54:42.1470584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1475343Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1478752Z (1423): here 2025-05-07T19:54:42.1478979Z 2025-05-07T19:54:42.1480284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1484871Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1488253Z (1475): here 2025-05-07T19:54:42.1488457Z 2025-05-07T19:54:42.1489744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1494137Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1497423Z (1527): here 2025-05-07T19:54:42.1497660Z 2025-05-07T19:54:42.1498966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1503562Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1506973Z (1579): here 2025-05-07T19:54:42.1507185Z 2025-05-07T19:54:42.1508475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1513149Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1516455Z (1631): here 2025-05-07T19:54:42.1516646Z 2025-05-07T19:54:42.1517677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1521172Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1523848Z (1683): here 2025-05-07T19:54:42.1524032Z 2025-05-07T19:54:42.1525043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1528711Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1531774Z (1735): here 2025-05-07T19:54:42.1531976Z 2025-05-07T19:54:42.1533029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1536795Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1539539Z (1787): here 2025-05-07T19:54:42.1539731Z 2025-05-07T19:54:42.1540690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1544595Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1547647Z (1839): here 2025-05-07T19:54:42.1547845Z 2025-05-07T19:54:42.1551750Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1555891Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1559093Z (1891): here 2025-05-07T19:54:42.1559287Z 2025-05-07T19:54:42.1560386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1564373Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1567354Z (1943): here 2025-05-07T19:54:42.1567547Z 2025-05-07T19:54:42.1568663Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1572449Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1575130Z (1995): here 2025-05-07T19:54:42.1575337Z 2025-05-07T19:54:42.1576534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1580922Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1584188Z (2047): here 2025-05-07T19:54:42.1584376Z 2025-05-07T19:54:42.1585580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1590021Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1593391Z (2099): here 2025-05-07T19:54:42.1593591Z 2025-05-07T19:54:42.1594949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_v2_kernel.cu(895): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:54:42.1599264Z detected during instantiation of "void split_embedding_codegen_forward_weighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const float *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:54:42.1602589Z (2151): here 2025-05-07T19:54:42.1602773Z 2025-05-07T19:54:43.2024769Z [153/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:43.2048052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.2049971Z 2025-05-07T19:54:43.2051685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.2053553Z 2025-05-07T19:54:43.2055679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.2057557Z 2025-05-07T19:54:43.2059192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.2061073Z 2025-05-07T19:54:43.2062670Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.2064573Z 2025-05-07T19:54:43.2066542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:43.2068452Z 2025-05-07T19:54:44.8201663Z [154/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:44.8225327Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:44.8227305Z 2025-05-07T19:54:44.8228978Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:44.8230823Z 2025-05-07T19:54:44.8233180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8234932Z 2025-05-07T19:54:44.8236552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8238351Z 2025-05-07T19:54:44.8239909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8241927Z 2025-05-07T19:54:44.8243407Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8245211Z 2025-05-07T19:54:44.8246891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:44.8248781Z 2025-05-07T19:54:44.8250481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:44.8252383Z 2025-05-07T19:54:44.8253961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8255773Z 2025-05-07T19:54:44.8257364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8259127Z 2025-05-07T19:54:44.8260714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8262505Z 2025-05-07T19:54:44.8264098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8265917Z 2025-05-07T19:54:44.8267580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:44.8269471Z 2025-05-07T19:54:44.8271152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:44.8273195Z 2025-05-07T19:54:44.8274782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(250): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8276509Z 2025-05-07T19:54:44.8278036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(255): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8279790Z 2025-05-07T19:54:44.8281565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(272): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8283307Z 2025-05-07T19:54:44.8284846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu(284): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:44.8286611Z 2025-05-07T19:54:45.2976015Z [155/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o 2025-05-07T19:54:45.2999435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.3001218Z 2025-05-07T19:54:45.3002923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.3004869Z 2025-05-07T19:54:45.3006503Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3008375Z 2025-05-07T19:54:45.3010424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3012234Z 2025-05-07T19:54:45.3013757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3015596Z 2025-05-07T19:54:45.3017318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.3019290Z 2025-05-07T19:54:45.3020960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.3022728Z 2025-05-07T19:54:45.3024216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3026003Z 2025-05-07T19:54:45.3027598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3029354Z 2025-05-07T19:54:45.3030990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3033264Z 2025-05-07T19:54:45.3034943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.3036753Z 2025-05-07T19:54:45.3038419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:45.3040304Z 2025-05-07T19:54:45.3041929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(216): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3043756Z 2025-05-07T19:54:45.3045374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(233): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3047228Z 2025-05-07T19:54:45.3048861Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu(243): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:45.3050713Z 2025-05-07T19:54:47.6615798Z [156/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o 2025-05-07T19:54:47.6639070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.6640943Z 2025-05-07T19:54:47.6642656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.6644568Z 2025-05-07T19:54:47.6646127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6647927Z 2025-05-07T19:54:47.6649461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6650859Z 2025-05-07T19:54:47.6652157Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6653609Z 2025-05-07T19:54:47.6654904Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6656392Z 2025-05-07T19:54:47.6657811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.6659668Z 2025-05-07T19:54:47.6661270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.6663407Z 2025-05-07T19:54:47.6664950Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6666638Z 2025-05-07T19:54:47.6667932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6669494Z 2025-05-07T19:54:47.6670799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6672545Z 2025-05-07T19:54:47.6674074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6675698Z 2025-05-07T19:54:47.6677156Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.6678850Z 2025-05-07T19:54:47.6680466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:47.6682328Z 2025-05-07T19:54:47.6683808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(266): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6685480Z 2025-05-07T19:54:47.6686867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(271): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6688619Z 2025-05-07T19:54:47.6690142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6691978Z 2025-05-07T19:54:47.6693583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu(300): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:47.6695339Z 2025-05-07T19:54:49.4890704Z [157/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o 2025-05-07T19:54:49.4911826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.4913832Z 2025-05-07T19:54:49.4915403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.4917225Z 2025-05-07T19:54:49.4918638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.4920353Z 2025-05-07T19:54:49.4921951Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.4923887Z 2025-05-07T19:54:49.4925567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.4927377Z 2025-05-07T19:54:49.4929041Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:49.4930929Z 2025-05-07T19:54:50.2776330Z [158/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o 2025-05-07T19:54:50.2801735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.2803587Z 2025-05-07T19:54:50.2805243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.2807095Z 2025-05-07T19:54:50.2808708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.2810471Z 2025-05-07T19:54:50.2811748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.2813500Z 2025-05-07T19:54:50.2815013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.2816669Z 2025-05-07T19:54:50.2818273Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:50.2819988Z 2025-05-07T19:54:51.9620955Z [159/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o 2025-05-07T19:54:51.9644625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9646568Z 2025-05-07T19:54:51.9648254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9650179Z 2025-05-07T19:54:51.9651837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9653740Z 2025-05-07T19:54:51.9655462Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9657365Z 2025-05-07T19:54:51.9659034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9660953Z 2025-05-07T19:54:51.9662628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:51.9664538Z 2025-05-07T19:54:52.1239906Z [160/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_sgd_pt2_autograd.cpp 2025-05-07T19:54:52.1259314Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:52.3004212Z [161/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o 2025-05-07T19:54:52.3026870Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.3028817Z 2025-05-07T19:54:52.3030228Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.3032280Z 2025-05-07T19:54:52.3033885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.3035904Z 2025-05-07T19:54:52.3037497Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.3039438Z 2025-05-07T19:54:52.3041020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.3042779Z 2025-05-07T19:54:52.3044406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:52.3046288Z 2025-05-07T19:54:56.1124880Z [162/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:56.1145187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.1146877Z 2025-05-07T19:54:56.1148364Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.1150430Z 2025-05-07T19:54:56.1151731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.1153381Z 2025-05-07T19:54:56.1154817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.1156478Z 2025-05-07T19:54:56.1157919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.1159518Z 2025-05-07T19:54:56.1161014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:56.1162670Z 2025-05-07T19:54:56.8547826Z [163/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_sgd_pt2_autograd.cpp 2025-05-07T19:54:56.8564415Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.0057070Z [164/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:54:57.0077643Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.0079084Z 2025-05-07T19:54:57.0080607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.0082384Z 2025-05-07T19:54:57.0083880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.0085552Z 2025-05-07T19:54:57.0086954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.0088604Z 2025-05-07T19:54:57.0090186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.0091981Z 2025-05-07T19:54:57.0093883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.0095704Z 2025-05-07T19:54:57.0395316Z [165/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.0413317Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.0735903Z [166/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.0753783Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.1076009Z [167/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.1438651Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.1459045Z [168/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:57.1479761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1481454Z 2025-05-07T19:54:57.1482999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1484723Z 2025-05-07T19:54:57.1486123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1487710Z 2025-05-07T19:54:57.1489091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1490708Z 2025-05-07T19:54:57.1492088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1493748Z 2025-05-07T19:54:57.1494935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1496265Z 2025-05-07T19:54:57.1497721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1499349Z 2025-05-07T19:54:57.1500872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1502460Z 2025-05-07T19:54:57.1503839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1505740Z 2025-05-07T19:54:57.1507154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1508738Z 2025-05-07T19:54:57.1510155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1511707Z 2025-05-07T19:54:57.1513250Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1514882Z 2025-05-07T19:54:57.1516032Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1517421Z 2025-05-07T19:54:57.1518911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:57.1520423Z 2025-05-07T19:54:57.1521612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1523296Z 2025-05-07T19:54:57.1524829Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1526426Z 2025-05-07T19:54:57.1527836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1529291Z 2025-05-07T19:54:57.1530612Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:57.1532636Z 2025-05-07T19:54:57.1549281Z [169/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.1567655Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.1781382Z [170/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.1800037Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.1819222Z [171/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.1837216Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2127148Z [172/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.2145961Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2165719Z [173/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.2184430Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2468874Z [174/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.2487990Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2507349Z [175/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.2526745Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2773020Z [176/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.2804774Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.2822174Z [177/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.2841148Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:57.3194755Z [178/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:54:57.3212854Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.2350745Z [179/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:58.2369139Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:58.7577269Z [180/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o 2025-05-07T19:54:58.7601680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.7603449Z 2025-05-07T19:54:58.7605134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.7606752Z 2025-05-07T19:54:58.7608368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7610137Z 2025-05-07T19:54:58.7611767Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7613478Z 2025-05-07T19:54:58.7615370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7617070Z 2025-05-07T19:54:58.7618581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7620254Z 2025-05-07T19:54:58.7621822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.7623754Z 2025-05-07T19:54:58.7625357Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.7627158Z 2025-05-07T19:54:58.7628596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7630372Z 2025-05-07T19:54:58.7632275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7634232Z 2025-05-07T19:54:58.7635824Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7637692Z 2025-05-07T19:54:58.7639303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7641109Z 2025-05-07T19:54:58.7642806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.7644716Z 2025-05-07T19:54:58.7646423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:54:58.7648349Z 2025-05-07T19:54:58.7649863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7651566Z 2025-05-07T19:54:58.7653100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7654767Z 2025-05-07T19:54:58.7656212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7657886Z 2025-05-07T19:54:58.7659518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:54:58.7661376Z 2025-05-07T19:54:58.7844201Z [181/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:54:58.7865024Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:54:59.4980525Z [182/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:54:59.5001274Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:00.4448741Z [183/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:55:00.4468141Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.0494862Z [184/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_lars_sgd_pt2_autograd.cpp 2025-05-07T19:55:01.0514956Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:01.4756314Z [185/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp 2025-05-07T19:55:01.4770735Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.1501454Z [186/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_none_pt2_autograd.cpp 2025-05-07T19:55:02.1519760Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.3748112Z [187/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_lamb_pt2_autograd.cpp 2025-05-07T19:55:02.3766380Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.4093772Z [188/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.4112787Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.4433484Z [189/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.4452468Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.4849385Z [190/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp 2025-05-07T19:55:02.4867518Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.4886405Z [191/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.4905039Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.5216471Z [192/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.5236226Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.5255221Z [193/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.5274535Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.5580018Z [194/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:02.5598618Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.7833938Z [195/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:02.7855313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.7857097Z 2025-05-07T19:55:02.7858704Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.7860440Z 2025-05-07T19:55:02.7861969Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.7863605Z 2025-05-07T19:55:02.7865181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.7866971Z 2025-05-07T19:55:02.7868775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.7870490Z 2025-05-07T19:55:02.7871938Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:02.7873864Z 2025-05-07T19:55:02.8501563Z [196/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp 2025-05-07T19:55:02.8520842Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:02.9307752Z [197/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp 2025-05-07T19:55:02.9326744Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:03.1045225Z [198/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o 2025-05-07T19:55:03.1068122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.1069990Z 2025-05-07T19:55:03.1075205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.1077255Z 2025-05-07T19:55:03.1078748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1080457Z 2025-05-07T19:55:03.1081988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1084064Z 2025-05-07T19:55:03.1085617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1087358Z 2025-05-07T19:55:03.1088988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.1090773Z 2025-05-07T19:55:03.1092422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.1094322Z 2025-05-07T19:55:03.1095934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1097433Z 2025-05-07T19:55:03.1098830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1100538Z 2025-05-07T19:55:03.1102081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1103850Z 2025-05-07T19:55:03.1105408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.1107186Z 2025-05-07T19:55:03.1108796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:03.1110615Z 2025-05-07T19:55:03.1112208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(219): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1114107Z 2025-05-07T19:55:03.1115650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(236): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1117406Z 2025-05-07T19:55:03.1118997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu(246): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:03.1120821Z 2025-05-07T19:55:04.7865515Z [199/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:55:04.7886251Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:05.8668254Z [200/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o 2025-05-07T19:55:05.8692202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8694576Z 2025-05-07T19:55:05.8696397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8698364Z 2025-05-07T19:55:05.8700063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8702049Z 2025-05-07T19:55:05.8703814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8705621Z 2025-05-07T19:55:05.8707343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8709348Z 2025-05-07T19:55:05.8711135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.8713253Z 2025-05-07T19:55:05.9152873Z [201/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o 2025-05-07T19:55:05.9176340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.9177966Z 2025-05-07T19:55:05.9179296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.9180680Z 2025-05-07T19:55:05.9181925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.9183401Z 2025-05-07T19:55:05.9184835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.9186537Z 2025-05-07T19:55:05.9188034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.9189649Z 2025-05-07T19:55:05.9191110Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:05.9193069Z 2025-05-07T19:55:08.7092916Z [202/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp 2025-05-07T19:55:08.7109985Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:08.9941327Z [203/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_adam_pt2_autograd.cpp 2025-05-07T19:55:08.9961099Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:09.4558481Z [204/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp 2025-05-07T19:55:09.4575788Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:12.7613901Z [205/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:12.7634956Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.7048121Z [206/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp 2025-05-07T19:55:14.7065288Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:14.8059246Z [207/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o 2025-05-07T19:55:14.8080000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.8081723Z 2025-05-07T19:55:14.8083016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.8085184Z 2025-05-07T19:55:14.8086843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.8088580Z 2025-05-07T19:55:14.8090023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.8091571Z 2025-05-07T19:55:14.8092910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.8094500Z 2025-05-07T19:55:14.8095913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:14.8097566Z 2025-05-07T19:55:14.9382475Z [208/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:14.9399810Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.1701492Z [209/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp 2025-05-07T19:55:15.4921124Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:15.4936857Z [210/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_cpu.cpp 2025-05-07T19:55:15.4955182Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:16.1419795Z [211/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:16.1437112Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.3070291Z [212/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp 2025-05-07T19:55:17.3087938Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.3613516Z [213/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd_cpu.cpp 2025-05-07T19:55:17.3630231Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.3928402Z [214/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb_cpu.cpp 2025-05-07T19:55:17.3945512Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.4076050Z [215/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_cpu.cpp 2025-05-07T19:55:17.4094193Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.5186404Z [216/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:17.5203567Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.5929034Z [217/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:17.5946493Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:17.7634573Z [218/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o 2025-05-07T19:55:17.7664475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7666863Z 2025-05-07T19:55:17.7668908Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7671274Z 2025-05-07T19:55:17.7673468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7675787Z 2025-05-07T19:55:17.7678133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7680628Z 2025-05-07T19:55:17.7682830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:17.7685179Z 2025-05-07T19:55:17.7687290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:18.0871953Z 2025-05-07T19:55:18.0893468Z [219/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp 2025-05-07T19:55:18.0914044Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:18.2344747Z [220/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp 2025-05-07T19:55:18.2364909Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:18.2860769Z [221/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp 2025-05-07T19:55:18.2881889Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:18.4828192Z [222/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/backward/embedding_backward_dense_host_cpu.cpp 2025-05-07T19:55:18.4853374Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.8898906Z [223/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad_cpu.cpp 2025-05-07T19:55:19.8919375Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:19.9120896Z [224/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd_cpu.cpp 2025-05-07T19:55:19.9141074Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.0791624Z [225/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:20.0806999Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.4103118Z [226/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp 2025-05-07T19:55:20.4123550Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:20.4255120Z [227/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp 2025-05-07T19:55:20.4271615Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:21.3888820Z [228/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none_cpu.cpp 2025-05-07T19:55:21.3905572Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:21.6834351Z [229/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o 2025-05-07T19:55:21.6858299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.6860287Z 2025-05-07T19:55:21.6862021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.6863884Z 2025-05-07T19:55:21.6865529Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6867415Z 2025-05-07T19:55:21.6868972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6870827Z 2025-05-07T19:55:21.6872440Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6874488Z 2025-05-07T19:55:21.6876189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.6878133Z 2025-05-07T19:55:21.6879961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.6881820Z 2025-05-07T19:55:21.6883697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6885640Z 2025-05-07T19:55:21.6887223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6889073Z 2025-05-07T19:55:21.6890651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6892474Z 2025-05-07T19:55:21.6894394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.6896344Z 2025-05-07T19:55:21.6898097Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:21.6899962Z 2025-05-07T19:55:21.6901607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6903433Z 2025-05-07T19:55:21.6905068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6906908Z 2025-05-07T19:55:21.6908559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:21.6910420Z 2025-05-07T19:55:23.8062646Z [230/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp 2025-05-07T19:55:23.8083414Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:24.3619489Z [231/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:24.3640567Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.1603390Z [232/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd_cpu.cpp 2025-05-07T19:55:26.1620744Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.2217414Z [233/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp 2025-05-07T19:55:26.2236554Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.5776969Z [234/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp 2025-05-07T19:55:26.5796607Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.5967562Z [235/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:26.5986821Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:26.9867239Z [236/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp 2025-05-07T19:55:26.9887683Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:27.6829902Z [237/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp 2025-05-07T19:55:27.6846364Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:28.7668925Z [238/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_pt2_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp 2025-05-07T19:55:28.7687022Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:29.0946831Z [239/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o 2025-05-07T19:55:29.0966059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0967611Z 2025-05-07T19:55:29.0968965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0970545Z 2025-05-07T19:55:29.0971909Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0973430Z 2025-05-07T19:55:29.0974699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0976222Z 2025-05-07T19:55:29.0977678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0979172Z 2025-05-07T19:55:29.0980475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0982001Z 2025-05-07T19:55:29.0983397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0985026Z 2025-05-07T19:55:29.0986568Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.0988078Z 2025-05-07T19:55:29.0989436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0990896Z 2025-05-07T19:55:29.0992397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0994096Z 2025-05-07T19:55:29.0995432Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0996963Z 2025-05-07T19:55:29.0998280Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.0999807Z 2025-05-07T19:55:29.1001511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1003194Z 2025-05-07T19:55:29.1004755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.1006223Z 2025-05-07T19:55:29.1007669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.1009094Z 2025-05-07T19:55:29.1010373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.1011980Z 2025-05-07T19:55:29.1013359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.1014891Z 2025-05-07T19:55:29.1016164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.1017845Z 2025-05-07T19:55:29.3472774Z [240/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o 2025-05-07T19:55:29.3494946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.3496736Z 2025-05-07T19:55:29.3498388Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.3500312Z 2025-05-07T19:55:29.3501765Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3503292Z 2025-05-07T19:55:29.3504714Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3506324Z 2025-05-07T19:55:29.3507760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3509450Z 2025-05-07T19:55:29.3511021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3512856Z 2025-05-07T19:55:29.3514474Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.3516404Z 2025-05-07T19:55:29.3517739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.3519199Z 2025-05-07T19:55:29.3520711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3522284Z 2025-05-07T19:55:29.3523549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3525116Z 2025-05-07T19:55:29.3526505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3528095Z 2025-05-07T19:55:29.3529776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3531668Z 2025-05-07T19:55:29.3533215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.3534936Z 2025-05-07T19:55:29.3536597Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:29.3538717Z 2025-05-07T19:55:29.3540049Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(269): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3541645Z 2025-05-07T19:55:29.3542918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(274): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3544272Z 2025-05-07T19:55:29.3545629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(291): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3547454Z 2025-05-07T19:55:29.3548834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu(303): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:29.3550397Z 2025-05-07T19:55:29.5922429Z [241/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_pt2.so -o fbgemm_gpu_tbe_training_backward_pt2.so CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_lars_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_adam_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_partial_rowwise_lamb_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_none_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_sgd_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_counter_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_approx_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_adagrad_with_weight_decay_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_split_rowwise_weighted_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_ssd_rowwise_adagrad_pt2_autograd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_lars_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_adam_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_partial_rowwise_lamb_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_none_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_sgd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_pt2.dir/gen_embedding_backward_ssd_rowwise_adagrad_pt2_cuda_wrapper.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T19:55:29.8899241Z [242/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_cpu.cpp 2025-05-07T19:55:29.8918397Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:30.0776268Z [243/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o 2025-05-07T19:55:30.0797661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0799228Z 2025-05-07T19:55:30.0800654Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0802615Z 2025-05-07T19:55:30.0804177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0805887Z 2025-05-07T19:55:30.0807397Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0809099Z 2025-05-07T19:55:30.0810631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0812389Z 2025-05-07T19:55:30.0814178Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:30.0815667Z 2025-05-07T19:55:30.1817133Z [244/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp 2025-05-07T19:55:30.1836710Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:31.5112029Z [245/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o 2025-05-07T19:55:31.5134708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5136552Z 2025-05-07T19:55:31.5138001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5139776Z 2025-05-07T19:55:31.5141359Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5143115Z 2025-05-07T19:55:31.5144580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5146346Z 2025-05-07T19:55:31.5147866Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5149517Z 2025-05-07T19:55:31.5151040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5152991Z 2025-05-07T19:55:31.5154623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5156453Z 2025-05-07T19:55:31.5158063Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5159943Z 2025-05-07T19:55:31.5161431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5163078Z 2025-05-07T19:55:31.5164524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5166225Z 2025-05-07T19:55:31.5167769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5169484Z 2025-05-07T19:55:31.5171340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5173104Z 2025-05-07T19:55:31.5174741Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5176497Z 2025-05-07T19:55:31.5178169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.5180209Z 2025-05-07T19:55:31.5181734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5183476Z 2025-05-07T19:55:31.5185019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5186763Z 2025-05-07T19:55:31.5188236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5189893Z 2025-05-07T19:55:31.5191382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.5193210Z 2025-05-07T19:55:31.7048390Z [246/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o 2025-05-07T19:55:31.7071657Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7074080Z 2025-05-07T19:55:31.7075757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7077393Z 2025-05-07T19:55:31.7078697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7080309Z 2025-05-07T19:55:31.7081851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7083409Z 2025-05-07T19:55:31.7084816Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7086554Z 2025-05-07T19:55:31.7088112Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7089858Z 2025-05-07T19:55:31.7091476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7093288Z 2025-05-07T19:55:31.7094933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7096801Z 2025-05-07T19:55:31.7098354Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7100111Z 2025-05-07T19:55:31.7101630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7103356Z 2025-05-07T19:55:31.7104859Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7106510Z 2025-05-07T19:55:31.7107543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7108757Z 2025-05-07T19:55:31.7109872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7111225Z 2025-05-07T19:55:31.7112775Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:31.7114117Z 2025-05-07T19:55:31.7115278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(253): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7116613Z 2025-05-07T19:55:31.7117889Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(258): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7119710Z 2025-05-07T19:55:31.7121271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7122744Z 2025-05-07T19:55:31.7124091Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu(287): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:55:31.7125964Z 2025-05-07T19:55:31.7258728Z [247/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_cpu.cpp 2025-05-07T19:55:31.7277973Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:55:33.0784206Z [248/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o 2025-05-07T19:55:33.0808040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.0810003Z 2025-05-07T19:55:33.0811601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.0813363Z 2025-05-07T19:55:33.0814656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0818290Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0820559Z (946): here 2025-05-07T19:55:33.0820768Z 2025-05-07T19:55:33.0821855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0825658Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0828263Z (996): here 2025-05-07T19:55:33.0828455Z 2025-05-07T19:55:33.0829512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0833475Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0836622Z (1046): here 2025-05-07T19:55:33.0836801Z 2025-05-07T19:55:33.0838058Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0841490Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0844218Z (1096): here 2025-05-07T19:55:33.0844396Z 2025-05-07T19:55:33.0845504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0849466Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0852047Z (1146): here 2025-05-07T19:55:33.0852210Z 2025-05-07T19:55:33.0853159Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0856742Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0859525Z (1196): here 2025-05-07T19:55:33.0859749Z 2025-05-07T19:55:33.0861022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0865510Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0868882Z (1246): here 2025-05-07T19:55:33.0869108Z 2025-05-07T19:55:33.0870678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0874838Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0877585Z (1296): here 2025-05-07T19:55:33.0877778Z 2025-05-07T19:55:33.0878949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0883055Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0886062Z (1346): here 2025-05-07T19:55:33.0886270Z 2025-05-07T19:55:33.0887576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0891845Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0895155Z (1396): here 2025-05-07T19:55:33.0895381Z 2025-05-07T19:55:33.0896579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0900587Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0903943Z (1446): here 2025-05-07T19:55:33.0904160Z 2025-05-07T19:55:33.0905387Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0909672Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0912842Z (1496): here 2025-05-07T19:55:33.0913034Z 2025-05-07T19:55:33.0914311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0918514Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0921699Z (1546): here 2025-05-07T19:55:33.0922069Z 2025-05-07T19:55:33.0923376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0927837Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0931084Z (1596): here 2025-05-07T19:55:33.0931521Z 2025-05-07T19:55:33.0932850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0937167Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0940412Z (1646): here 2025-05-07T19:55:33.0940640Z 2025-05-07T19:55:33.0941949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0946394Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0949178Z (1696): here 2025-05-07T19:55:33.0949374Z 2025-05-07T19:55:33.0950481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0954720Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0957835Z (1746): here 2025-05-07T19:55:33.0958047Z 2025-05-07T19:55:33.0959398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0963770Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0966852Z (1796): here 2025-05-07T19:55:33.0967074Z 2025-05-07T19:55:33.0968444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0972842Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0976035Z (1846): here 2025-05-07T19:55:33.0976256Z 2025-05-07T19:55:33.0977369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0981262Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0984335Z (1896): here 2025-05-07T19:55:33.0984527Z 2025-05-07T19:55:33.0985755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0989948Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.0993323Z (1946): here 2025-05-07T19:55:33.0993517Z 2025-05-07T19:55:33.0994820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.0999076Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1002259Z (1996): here 2025-05-07T19:55:33.1002494Z 2025-05-07T19:55:33.1003686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1008323Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1011116Z (2046): here 2025-05-07T19:55:33.1011311Z 2025-05-07T19:55:33.1012476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1016862Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1020228Z (2096): here 2025-05-07T19:55:33.1020488Z 2025-05-07T19:55:33.1022105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.1023746Z 2025-05-07T19:55:33.1025352Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.1027004Z 2025-05-07T19:55:33.1028251Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1033023Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1036094Z (946): here 2025-05-07T19:55:33.1036297Z 2025-05-07T19:55:33.1037581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1041470Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1044445Z (996): here 2025-05-07T19:55:33.1044645Z 2025-05-07T19:55:33.1045902Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1050344Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1053425Z (1046): here 2025-05-07T19:55:33.1053644Z 2025-05-07T19:55:33.1054906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1059200Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1062460Z (1096): here 2025-05-07T19:55:33.1062674Z 2025-05-07T19:55:33.1063958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1068193Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1071352Z (1146): here 2025-05-07T19:55:33.1071565Z 2025-05-07T19:55:33.1072860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1077027Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1080263Z (1196): here 2025-05-07T19:55:33.1080485Z 2025-05-07T19:55:33.1081747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1086175Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1089414Z (1246): here 2025-05-07T19:55:33.1089646Z 2025-05-07T19:55:33.1090906Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1095352Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1098569Z (1296): here 2025-05-07T19:55:33.1098801Z 2025-05-07T19:55:33.1100297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1104630Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1107597Z (1346): here 2025-05-07T19:55:33.1107789Z 2025-05-07T19:55:33.1108924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1113220Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1116423Z (1396): here 2025-05-07T19:55:33.1116640Z 2025-05-07T19:55:33.1117924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1122294Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1125504Z (1446): here 2025-05-07T19:55:33.1125708Z 2025-05-07T19:55:33.1127005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1130361Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1132881Z (1496): here 2025-05-07T19:55:33.1133116Z 2025-05-07T19:55:33.1134008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1137318Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1139982Z (1546): here 2025-05-07T19:55:33.1140180Z 2025-05-07T19:55:33.1141638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1145563Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1149027Z (1596): here 2025-05-07T19:55:33.1149250Z 2025-05-07T19:55:33.1150665Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1154823Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1157651Z (1646): here 2025-05-07T19:55:33.1157842Z 2025-05-07T19:55:33.1158776Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1162615Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1165080Z (1696): here 2025-05-07T19:55:33.1165240Z 2025-05-07T19:55:33.1166175Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1170001Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1172961Z (1746): here 2025-05-07T19:55:33.1173137Z 2025-05-07T19:55:33.1174104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1178231Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1181785Z (1796): here 2025-05-07T19:55:33.1182011Z 2025-05-07T19:55:33.1183396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1187936Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1191272Z (1846): here 2025-05-07T19:55:33.1191452Z 2025-05-07T19:55:33.1192917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1197360Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1200738Z (1896): here 2025-05-07T19:55:33.1200948Z 2025-05-07T19:55:33.1202133Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1206360Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1209686Z (1946): here 2025-05-07T19:55:33.1209920Z 2025-05-07T19:55:33.1211236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1215788Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1219092Z (1996): here 2025-05-07T19:55:33.1219313Z 2025-05-07T19:55:33.1220325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1223611Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1226864Z (2046): here 2025-05-07T19:55:33.1227086Z 2025-05-07T19:55:33.1228243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1232775Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, __nv_bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1236233Z (2096): here 2025-05-07T19:55:33.1236454Z 2025-05-07T19:55:33.1238226Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.1240357Z 2025-05-07T19:55:33.1242122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:33.1244133Z 2025-05-07T19:55:33.1245515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1250192Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1253620Z (946): here 2025-05-07T19:55:33.1253829Z 2025-05-07T19:55:33.1255194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1259775Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1263174Z (996): here 2025-05-07T19:55:33.1263376Z 2025-05-07T19:55:33.1264726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1269384Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1272848Z (1046): here 2025-05-07T19:55:33.1273052Z 2025-05-07T19:55:33.1274384Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1278937Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1282374Z (1096): here 2025-05-07T19:55:33.1282615Z 2025-05-07T19:55:33.1283970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1288502Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1291847Z (1146): here 2025-05-07T19:55:33.1292071Z 2025-05-07T19:55:33.1293400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1297956Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1301267Z (1196): here 2025-05-07T19:55:33.1301465Z 2025-05-07T19:55:33.1302760Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1307211Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1310554Z (1246): here 2025-05-07T19:55:33.1310751Z 2025-05-07T19:55:33.1311961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1315770Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=float, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1318704Z (1296): here 2025-05-07T19:55:33.1318898Z 2025-05-07T19:55:33.1320166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1324780Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1328120Z (1346): here 2025-05-07T19:55:33.1328357Z 2025-05-07T19:55:33.1329901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1334779Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1338338Z (1396): here 2025-05-07T19:55:33.1338554Z 2025-05-07T19:55:33.1339900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1344455Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1347842Z (1446): here 2025-05-07T19:55:33.1348061Z 2025-05-07T19:55:33.1349376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1354007Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1357317Z (1496): here 2025-05-07T19:55:33.1357534Z 2025-05-07T19:55:33.1358887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1363392Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1366611Z (1546): here 2025-05-07T19:55:33.1366836Z 2025-05-07T19:55:33.1368146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1372667Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1375345Z (1596): here 2025-05-07T19:55:33.1375544Z 2025-05-07T19:55:33.1376606Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1382231Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1385624Z (1646): here 2025-05-07T19:55:33.1385846Z 2025-05-07T19:55:33.1387149Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1391853Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::Half, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1395285Z (1696): here 2025-05-07T19:55:33.1395488Z 2025-05-07T19:55:33.1396809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1401327Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1404666Z (1746): here 2025-05-07T19:55:33.1404880Z 2025-05-07T19:55:33.1406215Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1410754Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1414047Z (1796): here 2025-05-07T19:55:33.1414266Z 2025-05-07T19:55:33.1415598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1420160Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1423436Z (1846): here 2025-05-07T19:55:33.1423644Z 2025-05-07T19:55:33.1424924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1429533Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int32_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1433098Z (1896): here 2025-05-07T19:55:33.1433293Z 2025-05-07T19:55:33.1434426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1438492Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1441894Z (1946): here 2025-05-07T19:55:33.1442115Z 2025-05-07T19:55:33.1443481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1448043Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=float, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1451455Z (1996): here 2025-05-07T19:55:33.1451673Z 2025-05-07T19:55:33.1453005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1457559Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=float, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1460933Z (2046): here 2025-05-07T19:55:33.1461161Z 2025-05-07T19:55:33.1462458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_v2_kernel.cu(873): warning #68-D: integer conversion resulted in a change of sign 2025-05-07T19:55:33.1467115Z detected during instantiation of "void split_embedding_codegen_forward_unweighted_v2_kernel(const emb_t *, const emb_t *, const cache_t *, const int32_t *, uint32_t, uint32_t, bool, uint32_t, fbgemm_gpu::FixedDivisor, const index_t *, const index_t *, const uint32_t *, const int64_t *, const int32_t *, output_t *) [with emb_t=c10::Half, cache_t=c10::Half, output_t=c10::BFloat16, index_t=int64_t, USE_LXU_CACHE=false]" 2025-05-07T19:55:33.1470437Z (2096): here 2025-05-07T19:55:33.1470643Z 2025-05-07T19:55:37.0356133Z [249/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o 2025-05-07T19:55:37.0379297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:37.0381316Z 2025-05-07T19:55:37.0383011Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:37.0384906Z 2025-05-07T19:55:37.0386571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:37.0388437Z 2025-05-07T19:55:37.0390153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:37.0392066Z 2025-05-07T19:55:37.0393865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:37.0395619Z 2025-05-07T19:55:37.0397102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:37.0398670Z 2025-05-07T19:55:41.4680390Z [250/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_dense_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o 2025-05-07T19:55:41.4701628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:41.4703357Z 2025-05-07T19:55:41.4704952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:41.4706783Z 2025-05-07T19:55:41.4708399Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:41.4710254Z 2025-05-07T19:55:41.4711822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:41.4713581Z 2025-05-07T19:55:41.4715223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:41.4716993Z 2025-05-07T19:55:41.4718616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:41.4720522Z 2025-05-07T19:55:54.0760428Z [251/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_grad_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o 2025-05-07T19:55:54.0782247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.0784131Z 2025-05-07T19:55:54.0785778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.0787280Z 2025-05-07T19:55:54.0788596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.0790106Z 2025-05-07T19:55:54.0791464Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.0793147Z 2025-05-07T19:55:54.0794590Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.0796258Z 2025-05-07T19:55:54.0797698Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:55:54.0799235Z 2025-05-07T19:56:29.4202463Z [252/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:29.4225742Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.4227757Z 2025-05-07T19:56:29.4229528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.4231743Z 2025-05-07T19:56:29.4233481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.4235286Z 2025-05-07T19:56:29.4236878Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.4238488Z 2025-05-07T19:56:29.4240119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.4241863Z 2025-05-07T19:56:29.4243839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:29.4245651Z 2025-05-07T19:56:31.1504938Z [253/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:31.1528780Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1530673Z 2025-05-07T19:56:31.1532635Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1534638Z 2025-05-07T19:56:31.1536369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1538300Z 2025-05-07T19:56:31.1540052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1542019Z 2025-05-07T19:56:31.1543733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1545827Z 2025-05-07T19:56:31.1547477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.1549396Z 2025-05-07T19:56:31.9825130Z [254/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:31.9847956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9849810Z 2025-05-07T19:56:31.9851444Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9853233Z 2025-05-07T19:56:31.9854805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9856598Z 2025-05-07T19:56:31.9858134Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9859926Z 2025-05-07T19:56:31.9861911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9863688Z 2025-05-07T19:56:31.9865291Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:31.9867067Z 2025-05-07T19:56:33.0044492Z [255/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:33.0066615Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0068076Z 2025-05-07T19:56:33.0069550Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0071399Z 2025-05-07T19:56:33.0073176Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0074939Z 2025-05-07T19:56:33.0076876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0078679Z 2025-05-07T19:56:33.0080325Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0082245Z 2025-05-07T19:56:33.0083676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.0085740Z 2025-05-07T19:56:33.1895158Z [256/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o 2025-05-07T19:56:33.1917209Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.1918969Z 2025-05-07T19:56:33.1920571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.1922303Z 2025-05-07T19:56:33.1924323Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.1926087Z 2025-05-07T19:56:33.1927685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.1929467Z 2025-05-07T19:56:33.1931029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.1933335Z 2025-05-07T19:56:33.1934843Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.1936634Z 2025-05-07T19:56:33.5925524Z [257/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o 2025-05-07T19:56:33.5949585Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.5951497Z 2025-05-07T19:56:33.5953567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.5955546Z 2025-05-07T19:56:33.5957195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5958971Z 2025-05-07T19:56:33.5960496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5962602Z 2025-05-07T19:56:33.5964249Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5966140Z 2025-05-07T19:56:33.5967772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.5969473Z 2025-05-07T19:56:33.5971199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.5972942Z 2025-05-07T19:56:33.5974537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5976294Z 2025-05-07T19:56:33.5977815Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5979560Z 2025-05-07T19:56:33.5981129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5982857Z 2025-05-07T19:56:33.5984446Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.5986295Z 2025-05-07T19:56:33.5988019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.5989962Z 2025-05-07T19:56:33.5991628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(220): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5993627Z 2025-05-07T19:56:33.5995188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(237): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.5996935Z 2025-05-07T19:56:33.5998454Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu(247): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:56:33.6000142Z 2025-05-07T19:56:33.9524689Z [258/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o 2025-05-07T19:56:33.9549860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.9551910Z 2025-05-07T19:56:33.9553769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.9555822Z 2025-05-07T19:56:33.9557608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.9559564Z 2025-05-07T19:56:33.9561353Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.9563327Z 2025-05-07T19:56:33.9565050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.9567021Z 2025-05-07T19:56:33.9568755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:33.9570764Z 2025-05-07T19:56:34.9286388Z [259/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:34.9310899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.9313144Z 2025-05-07T19:56:34.9314947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.9316962Z 2025-05-07T19:56:34.9318699Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.9320662Z 2025-05-07T19:56:34.9322445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.9324466Z 2025-05-07T19:56:34.9326222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.9327991Z 2025-05-07T19:56:34.9329813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:34.9331915Z 2025-05-07T19:56:35.6353337Z [260/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o 2025-05-07T19:56:35.6378065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6380075Z 2025-05-07T19:56:35.6381806Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6383788Z 2025-05-07T19:56:35.6385539Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6387497Z 2025-05-07T19:56:35.6389292Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6391258Z 2025-05-07T19:56:35.6393518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6395458Z 2025-05-07T19:56:35.6397186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6399142Z 2025-05-07T19:56:35.6508268Z [261/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o 2025-05-07T19:56:35.6533244Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6535243Z 2025-05-07T19:56:35.6537098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6539116Z 2025-05-07T19:56:35.6540873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6542817Z 2025-05-07T19:56:35.6544507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6546456Z 2025-05-07T19:56:35.6548423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6550353Z 2025-05-07T19:56:35.6552094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:35.6554139Z 2025-05-07T19:56:37.0262447Z [262/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:37.0287311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0289115Z 2025-05-07T19:56:37.0290724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0292541Z 2025-05-07T19:56:37.0294193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0296040Z 2025-05-07T19:56:37.0297988Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0299824Z 2025-05-07T19:56:37.0301416Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0303198Z 2025-05-07T19:56:37.0304804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.0306827Z 2025-05-07T19:56:37.4017534Z [263/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o 2025-05-07T19:56:37.4038674Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.4040598Z 2025-05-07T19:56:37.4042277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.4043814Z 2025-05-07T19:56:37.4044929Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:37.4046313Z 2025-05-07T19:56:37.4048311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.4050027Z 2025-05-07T19:56:37.4051651Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.4053480Z 2025-05-07T19:56:37.4054430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_gwd_kernel.cu(240): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:37.4055847Z 2025-05-07T19:56:37.4057180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.4058662Z 2025-05-07T19:56:37.4059982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.4061493Z 2025-05-07T19:56:37.7291686Z [264/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:37.7313055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.7314603Z 2025-05-07T19:56:37.7316223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.7318162Z 2025-05-07T19:56:37.7319921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.7321896Z 2025-05-07T19:56:37.7323573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.7325406Z 2025-05-07T19:56:37.7327035Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.7328817Z 2025-05-07T19:56:37.7330507Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.7332709Z 2025-05-07T19:56:37.8733173Z [265/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:37.8757005Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8759017Z 2025-05-07T19:56:37.8760769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8762746Z 2025-05-07T19:56:37.8764751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8766686Z 2025-05-07T19:56:37.8768225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8770048Z 2025-05-07T19:56:37.8771676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8773456Z 2025-05-07T19:56:37.8775190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:37.8777177Z 2025-05-07T19:56:42.5861491Z [266/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:42.5884189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5886088Z 2025-05-07T19:56:42.5887743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5889968Z 2025-05-07T19:56:42.5891626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5893483Z 2025-05-07T19:56:42.5895174Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5897050Z 2025-05-07T19:56:42.5898695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5900588Z 2025-05-07T19:56:42.5902260Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:42.5904145Z 2025-05-07T19:56:44.1795337Z [267/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:44.1814895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.1816880Z 2025-05-07T19:56:44.1818219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.1819743Z 2025-05-07T19:56:44.1821125Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.1822625Z 2025-05-07T19:56:44.1823997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.1825545Z 2025-05-07T19:56:44.1826899Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.1828431Z 2025-05-07T19:56:44.1829820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:44.1831656Z 2025-05-07T19:56:45.0277250Z [268/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:45.0298581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.0300309Z 2025-05-07T19:56:45.0301802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.0303508Z 2025-05-07T19:56:45.0304876Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.0306772Z 2025-05-07T19:56:45.0308219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.0309945Z 2025-05-07T19:56:45.0311420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.0313366Z 2025-05-07T19:56:45.0314952Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.0316673Z 2025-05-07T19:56:45.4369472Z [269/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o 2025-05-07T19:56:45.4391016Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4392403Z 2025-05-07T19:56:45.4394303Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4395873Z 2025-05-07T19:56:45.4397066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4398457Z 2025-05-07T19:56:45.4399831Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4401618Z 2025-05-07T19:56:45.4403225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4404661Z 2025-05-07T19:56:45.4405968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.4407649Z 2025-05-07T19:56:45.7382402Z [270/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o 2025-05-07T19:56:45.7405296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.7406955Z 2025-05-07T19:56:45.7408522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.7410348Z 2025-05-07T19:56:45.7411678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.7413233Z 2025-05-07T19:56:45.7414761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.7416515Z 2025-05-07T19:56:45.7418269Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.7420259Z 2025-05-07T19:56:45.7422009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:45.7424007Z 2025-05-07T19:56:46.0584576Z [271/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o 2025-05-07T19:56:46.0609025Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.0610972Z 2025-05-07T19:56:46.0612721Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.0614689Z 2025-05-07T19:56:46.0616426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.0618368Z 2025-05-07T19:56:46.0620121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.0622066Z 2025-05-07T19:56:46.0623710Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.0625525Z 2025-05-07T19:56:46.0627173Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.0628982Z 2025-05-07T19:56:46.3157242Z [272/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o 2025-05-07T19:56:46.3180691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3182718Z 2025-05-07T19:56:46.3184530Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3186569Z 2025-05-07T19:56:46.3188310Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3190264Z 2025-05-07T19:56:46.3191991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3194086Z 2025-05-07T19:56:46.3195821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3197762Z 2025-05-07T19:56:46.3199535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.3201498Z 2025-05-07T19:56:46.6398406Z [273/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:56:46.6420725Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6422621Z 2025-05-07T19:56:46.6424199Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6425979Z 2025-05-07T19:56:46.6427492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6429272Z 2025-05-07T19:56:46.6430858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6433007Z 2025-05-07T19:56:46.6434573Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6436475Z 2025-05-07T19:56:46.6437881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:46.6439683Z 2025-05-07T19:56:48.3253566Z [274/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:56:48.3274543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.3276366Z 2025-05-07T19:56:48.3277752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.3279532Z 2025-05-07T19:56:48.3281066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.3282762Z 2025-05-07T19:56:48.3284204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.3285958Z 2025-05-07T19:56:48.3287527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.3289273Z 2025-05-07T19:56:48.3290836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:48.3292628Z 2025-05-07T19:56:49.2106079Z [275/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:49.2129470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.2131547Z 2025-05-07T19:56:49.2133207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.2135007Z 2025-05-07T19:56:49.2136639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.2138592Z 2025-05-07T19:56:49.2140348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.2142375Z 2025-05-07T19:56:49.2144015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.2145938Z 2025-05-07T19:56:49.2147734Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.2149506Z 2025-05-07T19:56:49.7068624Z [276/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:56:49.7084931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7086282Z 2025-05-07T19:56:49.7087496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7088826Z 2025-05-07T19:56:49.7089995Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7091300Z 2025-05-07T19:56:49.7092460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7093780Z 2025-05-07T19:56:49.7094923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7096219Z 2025-05-07T19:56:49.7097400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:49.7098720Z 2025-05-07T19:56:50.0439035Z [277/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o 2025-05-07T19:56:50.0459987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0461704Z 2025-05-07T19:56:50.0463188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0464881Z 2025-05-07T19:56:50.0466346Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0468030Z 2025-05-07T19:56:50.0469459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0471141Z 2025-05-07T19:56:50.0472774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0474378Z 2025-05-07T19:56:50.0476138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:50.0477848Z 2025-05-07T19:56:51.8655274Z [278/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o 2025-05-07T19:56:51.8678296Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.8680295Z 2025-05-07T19:56:51.8681867Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.8683568Z 2025-05-07T19:56:51.8685082Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.8686804Z 2025-05-07T19:56:51.8688424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.8690314Z 2025-05-07T19:56:51.8691943Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.8694192Z 2025-05-07T19:56:51.8695962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:51.8697909Z 2025-05-07T19:56:52.3718152Z [279/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o 2025-05-07T19:56:52.3741279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3743146Z 2025-05-07T19:56:52.3744887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3746855Z 2025-05-07T19:56:52.3748560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3750420Z 2025-05-07T19:56:52.3751907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3753744Z 2025-05-07T19:56:52.3755617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3757382Z 2025-05-07T19:56:52.3759038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.3760863Z 2025-05-07T19:56:52.4834262Z [280/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:56:52.4856960Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.4858957Z 2025-05-07T19:56:52.4860691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.4862558Z 2025-05-07T19:56:52.4864076Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.4865802Z 2025-05-07T19:56:52.4867622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.4869433Z 2025-05-07T19:56:52.4871059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.4873036Z 2025-05-07T19:56:52.4874666Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:52.4876805Z 2025-05-07T19:56:53.7067868Z [281/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:56:53.7090650Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.7092248Z 2025-05-07T19:56:53.7093771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.7095658Z 2025-05-07T19:56:53.7097684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.7099587Z 2025-05-07T19:56:53.7101275Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.7103222Z 2025-05-07T19:56:53.7104810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.7106851Z 2025-05-07T19:56:53.7108419Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:53.7110171Z 2025-05-07T19:56:54.5806649Z [282/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o 2025-05-07T19:56:54.5818373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5819401Z 2025-05-07T19:56:54.5820282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5821273Z 2025-05-07T19:56:54.5822338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:54.5823262Z 2025-05-07T19:56:54.5824169Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5825134Z 2025-05-07T19:56:54.5825996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5827114Z 2025-05-07T19:56:54.5827822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_gwd_kernel.cu(231): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T19:56:54.5828641Z 2025-05-07T19:56:54.5829521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5830484Z 2025-05-07T19:56:54.5831625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:54.5832735Z 2025-05-07T19:56:56.1326449Z [283/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:56.1349563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.1351456Z 2025-05-07T19:56:56.1353213Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.1355067Z 2025-05-07T19:56:56.1357009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.1358815Z 2025-05-07T19:56:56.1360534Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.1362409Z 2025-05-07T19:56:56.1364059Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.1365932Z 2025-05-07T19:56:56.1367593Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:56.1369470Z 2025-05-07T19:56:57.4961420Z [284/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o 2025-05-07T19:56:57.4984787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.4986612Z 2025-05-07T19:56:57.4988363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.4993704Z 2025-05-07T19:56:57.4995200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.4996696Z 2025-05-07T19:56:57.4998131Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.4999816Z 2025-05-07T19:56:57.5001266Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5002961Z 2025-05-07T19:56:57.5004422Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5006132Z 2025-05-07T19:56:57.5007599Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5009291Z 2025-05-07T19:56:57.5010730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5012368Z 2025-05-07T19:56:57.5013771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5015453Z 2025-05-07T19:56:57.5016787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5018298Z 2025-05-07T19:56:57.5020019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.5021907Z 2025-05-07T19:56:57.5023639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.5025189Z 2025-05-07T19:56:57.5026231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(288): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5027323Z 2025-05-07T19:56:57.5028342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(289): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5029573Z 2025-05-07T19:56:57.5030893Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(136): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5032377Z 2025-05-07T19:56:57.5033515Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(137): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5034938Z 2025-05-07T19:56:57.5036219Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(774): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5037957Z 2025-05-07T19:56:57.5039132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(775): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5040605Z 2025-05-07T19:56:57.5041956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(627): warning #177-D: variable "error_code" was declared but never referenced 2025-05-07T19:56:57.5043563Z 2025-05-07T19:56:57.5044977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu(628): warning #177-D: variable "error_value" was declared but never referenced 2025-05-07T19:56:57.5046521Z 2025-05-07T19:56:57.5048204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.5049967Z 2025-05-07T19:56:57.5051514Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:56:57.5053428Z 2025-05-07T19:57:03.2482536Z [285/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:03.2506536Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.2508733Z 2025-05-07T19:57:03.2510524Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.2512652Z 2025-05-07T19:57:03.2514434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.2516450Z 2025-05-07T19:57:03.2518143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.2520086Z 2025-05-07T19:57:03.2521771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.2523714Z 2025-05-07T19:57:03.2525473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:03.2527450Z 2025-05-07T19:57:04.4143537Z [286/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:04.4168014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4169984Z 2025-05-07T19:57:04.4171694Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4173650Z 2025-05-07T19:57:04.4175436Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4177456Z 2025-05-07T19:57:04.4179238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4181262Z 2025-05-07T19:57:04.4182945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4184958Z 2025-05-07T19:57:04.4186684Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.4188422Z 2025-05-07T19:57:04.5814606Z [287/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:04.5839403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5841426Z 2025-05-07T19:57:04.5843193Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5845096Z 2025-05-07T19:57:04.5846880Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5848812Z 2025-05-07T19:57:04.5850556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5852558Z 2025-05-07T19:57:04.5854286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5856221Z 2025-05-07T19:57:04.5857914Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:04.5859936Z 2025-05-07T19:57:05.7493462Z [288/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o 2025-05-07T19:57:05.7515813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:05.7517619Z 2025-05-07T19:57:05.7519088Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:05.7520923Z 2025-05-07T19:57:05.7522386Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7524090Z 2025-05-07T19:57:05.7525561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7527250Z 2025-05-07T19:57:05.7528658Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7530320Z 2025-05-07T19:57:05.7532027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7533708Z 2025-05-07T19:57:05.7535272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:05.7537017Z 2025-05-07T19:57:05.7538598Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:05.7540377Z 2025-05-07T19:57:05.7541834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7543510Z 2025-05-07T19:57:05.7545319Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7547020Z 2025-05-07T19:57:05.7548438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7550095Z 2025-05-07T19:57:05.7551584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7553302Z 2025-05-07T19:57:05.7555120Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:05.7556925Z 2025-05-07T19:57:05.7558566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:05.7560423Z 2025-05-07T19:57:05.7561833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(270): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7563473Z 2025-05-07T19:57:05.7564968Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(275): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7566651Z 2025-05-07T19:57:05.7567980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(292): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7569673Z 2025-05-07T19:57:05.7571194Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu(304): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:05.7572868Z 2025-05-07T19:57:10.0254617Z [289/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:10.0276981Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0278564Z 2025-05-07T19:57:10.0279874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0281644Z 2025-05-07T19:57:10.0283117Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0284885Z 2025-05-07T19:57:10.0286527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0288315Z 2025-05-07T19:57:10.0289814Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0291373Z 2025-05-07T19:57:10.0292787Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:10.0294517Z 2025-05-07T19:57:14.0296366Z [290/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_indice_weights_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o 2025-05-07T19:57:14.0320223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.0321991Z 2025-05-07T19:57:14.0323695Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.0325668Z 2025-05-07T19:57:14.0327307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.0329300Z 2025-05-07T19:57:14.0331042Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.0333198Z 2025-05-07T19:57:14.0334873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:14.0336734Z 2025-05-07T19:57:15.1030858Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1033332Z 2025-05-07T19:57:15.1055537Z [291/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_inference_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o 2025-05-07T19:57:15.1079822Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1081761Z 2025-05-07T19:57:15.1083512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1085524Z 2025-05-07T19:57:15.1087121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1088933Z 2025-05-07T19:57:15.1090525Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1092364Z 2025-05-07T19:57:15.1094021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1095755Z 2025-05-07T19:57:15.1097195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1098814Z 2025-05-07T19:57:15.1100358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1101974Z 2025-05-07T19:57:15.1103366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1104933Z 2025-05-07T19:57:15.1106271Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1107856Z 2025-05-07T19:57:15.1109348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1111039Z 2025-05-07T19:57:15.1113020Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1114749Z 2025-05-07T19:57:15.1116231Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1117891Z 2025-05-07T19:57:15.1119634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1121784Z 2025-05-07T19:57:15.1123571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:15.1125506Z 2025-05-07T19:57:15.1127180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(254): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1129079Z 2025-05-07T19:57:15.1130745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(259): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1132920Z 2025-05-07T19:57:15.1134578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(276): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1136452Z 2025-05-07T19:57:15.1138127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu(288): warning #186-D: pointless comparison of unsigned integer with zero 2025-05-07T19:57:15.1140033Z 2025-05-07T19:57:16.1283059Z [292/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_inference.so -o fbgemm_gpu_tbe_inference.so CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_unweighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_weighted_codegen_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/codegen/inference/embedding_forward_quantized_split_lookup.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_weighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_nobag_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp32_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp16_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_fp8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int8_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int4_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_kernel_unweighted_int2_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_nobag_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_inference.dir/gen_embedding_forward_quantized_split_nbit_host_unweighted_codegen_cuda.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_config.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed asmjit.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T19:57:19.2336012Z [293/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:19.2347930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.2348918Z 2025-05-07T19:57:19.2349786Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.2350773Z 2025-05-07T19:57:19.2351627Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.2352730Z 2025-05-07T19:57:19.2353614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.2354600Z 2025-05-07T19:57:19.2355459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.2356440Z 2025-05-07T19:57:19.2357300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:19.2358286Z 2025-05-07T19:57:20.5749046Z [294/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:20.5772811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.5774726Z 2025-05-07T19:57:20.5776385Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.5778382Z 2025-05-07T19:57:20.5780141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.5782031Z 2025-05-07T19:57:20.5783766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.5785669Z 2025-05-07T19:57:20.5787336Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.5789243Z 2025-05-07T19:57:20.5790941Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:20.5792978Z 2025-05-07T19:57:22.2441608Z [295/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:22.2462496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.2464271Z 2025-05-07T19:57:22.2465808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.2467339Z 2025-05-07T19:57:22.2468756Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.2470406Z 2025-05-07T19:57:22.2471976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.2473996Z 2025-05-07T19:57:22.2480343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.2482246Z 2025-05-07T19:57:22.2483931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.2485815Z 2025-05-07T19:57:22.5085751Z [296/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:22.5106264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5108113Z 2025-05-07T19:57:22.5109375Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5111091Z 2025-05-07T19:57:22.5112557Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5114121Z 2025-05-07T19:57:22.5115486Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5120127Z 2025-05-07T19:57:22.5121731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5123534Z 2025-05-07T19:57:22.5125489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:22.5127320Z 2025-05-07T19:57:24.1996396Z [297/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:24.2018236Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.2020096Z 2025-05-07T19:57:24.2021634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.2023436Z 2025-05-07T19:57:24.2024997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.2026997Z 2025-05-07T19:57:24.2028631Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.2030311Z 2025-05-07T19:57:24.2032057Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.2033777Z 2025-05-07T19:57:24.2035558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:24.2037265Z 2025-05-07T19:57:25.7988749Z [298/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:25.8010963Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.8012845Z 2025-05-07T19:57:25.8014548Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.8016306Z 2025-05-07T19:57:25.8017939Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.8020078Z 2025-05-07T19:57:25.8021717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.8023533Z 2025-05-07T19:57:25.8025451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.8027260Z 2025-05-07T19:57:25.8028886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:25.8030645Z 2025-05-07T19:57:30.4383276Z [299/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:30.4403935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.4405622Z 2025-05-07T19:57:30.4407114Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.4409009Z 2025-05-07T19:57:30.4410483Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.4412159Z 2025-05-07T19:57:30.4413903Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.4415563Z 2025-05-07T19:57:30.4417054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.4418757Z 2025-05-07T19:57:30.4420259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.4422020Z 2025-05-07T19:57:30.6255746Z [300/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:30.6276338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6278545Z 2025-05-07T19:57:30.6280040Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6281817Z 2025-05-07T19:57:30.6283272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6284922Z 2025-05-07T19:57:30.6286752Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6288559Z 2025-05-07T19:57:30.6290081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6291799Z 2025-05-07T19:57:30.6293374Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:30.6295285Z 2025-05-07T19:57:31.7951777Z [301/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:31.7973024Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7975079Z 2025-05-07T19:57:31.7976600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7978274Z 2025-05-07T19:57:31.7980106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7981887Z 2025-05-07T19:57:31.7983429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7985129Z 2025-05-07T19:57:31.7986608Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7988483Z 2025-05-07T19:57:31.7989991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.7991746Z 2025-05-07T19:57:31.8383610Z [302/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:31.8406932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8408646Z 2025-05-07T19:57:31.8410478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8412010Z 2025-05-07T19:57:31.8413214Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8414888Z 2025-05-07T19:57:31.8416433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8418363Z 2025-05-07T19:57:31.8419715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8421204Z 2025-05-07T19:57:31.8422639Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:31.8424319Z 2025-05-07T19:57:32.4950122Z [303/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o 2025-05-07T19:57:32.4970512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.4972308Z 2025-05-07T19:57:32.4974071Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.4975897Z 2025-05-07T19:57:32.4977451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.4978984Z 2025-05-07T19:57:32.4980418Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.4982567Z 2025-05-07T19:57:32.4983954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.4985767Z 2025-05-07T19:57:32.4987220Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.4988921Z 2025-05-07T19:57:32.5335626Z [304/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:32.5360351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.5362208Z 2025-05-07T19:57:32.5363712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.5365605Z 2025-05-07T19:57:32.5367262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.5369253Z 2025-05-07T19:57:32.5370715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.5372565Z 2025-05-07T19:57:32.5374070Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.5375823Z 2025-05-07T19:57:32.5377414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.5379040Z 2025-05-07T19:57:32.7197111Z [305/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:32.7218363Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7220251Z 2025-05-07T19:57:32.7221751Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7223492Z 2025-05-07T19:57:32.7225264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7226805Z 2025-05-07T19:57:32.7228252Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7230038Z 2025-05-07T19:57:32.7231860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7233678Z 2025-05-07T19:57:32.7235177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:32.7236890Z 2025-05-07T19:57:34.9078928Z [306/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:34.9101293Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.9102950Z 2025-05-07T19:57:34.9104435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.9106280Z 2025-05-07T19:57:34.9107755Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.9109429Z 2025-05-07T19:57:34.9110797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.9112460Z 2025-05-07T19:57:34.9113982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.9115665Z 2025-05-07T19:57:34.9117027Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:34.9118565Z 2025-05-07T19:57:35.2483256Z [307/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:57:35.2498840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.2500268Z 2025-05-07T19:57:35.2501393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.2502652Z 2025-05-07T19:57:35.2503735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.2504969Z 2025-05-07T19:57:35.2506050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.2507317Z 2025-05-07T19:57:35.2508396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.2509626Z 2025-05-07T19:57:35.2510748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.2511972Z 2025-05-07T19:57:35.3791294Z [308/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o 2025-05-07T19:57:35.3808571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3810183Z 2025-05-07T19:57:35.3811558Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3813037Z 2025-05-07T19:57:35.3814324Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3815764Z 2025-05-07T19:57:35.3817065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3818546Z 2025-05-07T19:57:35.3819809Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3821163Z 2025-05-07T19:57:35.3822342Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.3823653Z 2025-05-07T19:57:35.5926126Z [309/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:35.5945009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.5946510Z 2025-05-07T19:57:35.5947805Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.5949248Z 2025-05-07T19:57:35.5950546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.5952005Z 2025-05-07T19:57:35.5953405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.5954883Z 2025-05-07T19:57:35.5956184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.5957647Z 2025-05-07T19:57:35.5958953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:35.5960434Z 2025-05-07T19:57:36.1734931Z [310/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o 2025-05-07T19:57:36.1751523Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1753020Z 2025-05-07T19:57:36.1754243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1755629Z 2025-05-07T19:57:36.1756849Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1758219Z 2025-05-07T19:57:36.1759459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1760830Z 2025-05-07T19:57:36.1762034Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1763413Z 2025-05-07T19:57:36.1764629Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.1766006Z 2025-05-07T19:57:36.2576664Z [311/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:36.2593392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.2594798Z 2025-05-07T19:57:36.2596184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.2597578Z 2025-05-07T19:57:36.2598800Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.2600166Z 2025-05-07T19:57:36.2601413Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.2602787Z 2025-05-07T19:57:36.2603997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.2605376Z 2025-05-07T19:57:36.2606596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.2607979Z 2025-05-07T19:57:36.9293876Z [312/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o 2025-05-07T19:57:36.9318006Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.9319929Z 2025-05-07T19:57:36.9321623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.9323604Z 2025-05-07T19:57:36.9325391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.9327175Z 2025-05-07T19:57:36.9328812Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.9330739Z 2025-05-07T19:57:36.9332937Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.9334892Z 2025-05-07T19:57:36.9336655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:36.9338631Z 2025-05-07T19:57:38.2285608Z [313/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o 2025-05-07T19:57:38.2307262Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2309114Z 2025-05-07T19:57:38.2310747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2312913Z 2025-05-07T19:57:38.2314709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2316671Z 2025-05-07T19:57:38.2318433Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2320137Z 2025-05-07T19:57:38.2321632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2323321Z 2025-05-07T19:57:38.2324840Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2326603Z 2025-05-07T19:57:38.2830188Z [314/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:57:38.2853468Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2855275Z 2025-05-07T19:57:38.2856900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2858688Z 2025-05-07T19:57:38.2860263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2861990Z 2025-05-07T19:57:38.2863729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2865760Z 2025-05-07T19:57:38.2867537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2869512Z 2025-05-07T19:57:38.2871164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:38.2873269Z 2025-05-07T19:57:39.1682775Z [315/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o 2025-05-07T19:57:39.1704518Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.1706324Z 2025-05-07T19:57:39.1708074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.1709842Z 2025-05-07T19:57:39.1711383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.1713314Z 2025-05-07T19:57:39.1714934Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.1716744Z 2025-05-07T19:57:39.1718445Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.1720251Z 2025-05-07T19:57:39.1721850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:39.1723873Z 2025-05-07T19:57:40.0862929Z [316/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:40.0880380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.0881803Z 2025-05-07T19:57:40.0883092Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.0884544Z 2025-05-07T19:57:40.0885810Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.0887198Z 2025-05-07T19:57:40.0888519Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.0889941Z 2025-05-07T19:57:40.0891200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.0892823Z 2025-05-07T19:57:40.0894066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:40.0895499Z 2025-05-07T19:57:41.5975648Z [317/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o 2025-05-07T19:57:41.5998567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.6000663Z 2025-05-07T19:57:41.6002414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.6004400Z 2025-05-07T19:57:41.6006222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.6008249Z 2025-05-07T19:57:41.6009923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.6011832Z 2025-05-07T19:57:41.6013584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.6015777Z 2025-05-07T19:57:41.6017492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:41.6019438Z 2025-05-07T19:57:43.7446941Z [318/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:43.7467656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7469407Z 2025-05-07T19:57:43.7470946Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7472798Z 2025-05-07T19:57:43.7474295Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7475976Z 2025-05-07T19:57:43.7477448Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7479396Z 2025-05-07T19:57:43.7480836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7482537Z 2025-05-07T19:57:43.7484216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:43.7485962Z 2025-05-07T19:57:45.0445660Z [319/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:45.0463970Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.0465537Z 2025-05-07T19:57:45.0466857Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.0468318Z 2025-05-07T19:57:45.0469623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.0471297Z 2025-05-07T19:57:45.0472724Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.0474210Z 2025-05-07T19:57:45.0475494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.0476860Z 2025-05-07T19:57:45.0478424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:45.0480077Z 2025-05-07T19:57:46.3952046Z [320/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:57:46.3973371Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3975179Z 2025-05-07T19:57:46.3976881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3978617Z 2025-05-07T19:57:46.3980056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3982072Z 2025-05-07T19:57:46.3983601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3985374Z 2025-05-07T19:57:46.3987207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3988981Z 2025-05-07T19:57:46.3990560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:46.3992379Z 2025-05-07T19:57:47.5145085Z [321/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o 2025-05-07T19:57:47.5165712Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.5167374Z 2025-05-07T19:57:47.5168798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.5170756Z 2025-05-07T19:57:47.5172309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.5174001Z 2025-05-07T19:57:47.5175542Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.5177309Z 2025-05-07T19:57:47.5179056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.5180770Z 2025-05-07T19:57:47.5182314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:47.5184065Z 2025-05-07T19:57:49.4688840Z [322/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o 2025-05-07T19:57:49.4709500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.4711324Z 2025-05-07T19:57:49.4713044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.4715036Z 2025-05-07T19:57:49.4716482Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.4717939Z 2025-05-07T19:57:49.4719470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.4721102Z 2025-05-07T19:57:49.4722642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.4724329Z 2025-05-07T19:57:49.4725835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:49.4727744Z 2025-05-07T19:57:51.9786901Z [323/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o 2025-05-07T19:57:51.9809414Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.9811628Z 2025-05-07T19:57:51.9813343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.9815234Z 2025-05-07T19:57:51.9816817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.9818761Z 2025-05-07T19:57:51.9820467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.9822402Z 2025-05-07T19:57:51.9824106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.9825900Z 2025-05-07T19:57:51.9827395Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:51.9829208Z 2025-05-07T19:57:52.9925378Z [324/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:57:52.9948143Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.9950019Z 2025-05-07T19:57:52.9951649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.9953617Z 2025-05-07T19:57:52.9955565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.9957463Z 2025-05-07T19:57:52.9959223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.9961235Z 2025-05-07T19:57:52.9962895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.9964916Z 2025-05-07T19:57:52.9966630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:52.9968366Z 2025-05-07T19:57:59.5481025Z [325/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o 2025-05-07T19:57:59.5503747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.5505319Z 2025-05-07T19:57:59.5507122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.5508910Z 2025-05-07T19:57:59.5510406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.5512132Z 2025-05-07T19:57:59.5513896Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.5515859Z 2025-05-07T19:57:59.5517478Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.5519348Z 2025-05-07T19:57:59.5521056Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:57:59.5522954Z 2025-05-07T19:58:24.6250903Z [326/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:24.6274229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.6276162Z 2025-05-07T19:58:24.6278426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.6280362Z 2025-05-07T19:58:24.6282064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.6284016Z 2025-05-07T19:58:24.6285743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.6287808Z 2025-05-07T19:58:24.6289560Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.6291485Z 2025-05-07T19:58:24.6293210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:24.6295187Z 2025-05-07T19:58:25.2152701Z [327/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:25.2191731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.2193332Z 2025-05-07T19:58:25.2194580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.2195970Z 2025-05-07T19:58:25.2197192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.2198716Z 2025-05-07T19:58:25.2199942Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.2201360Z 2025-05-07T19:58:25.2202584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.2203943Z 2025-05-07T19:58:25.2205200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:25.2206607Z 2025-05-07T19:58:26.1193957Z [328/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:26.1216279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.1218039Z 2025-05-07T19:58:26.1219549Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.1221306Z 2025-05-07T19:58:26.1223171Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.1224878Z 2025-05-07T19:58:26.1226477Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.1228254Z 2025-05-07T19:58:26.1229830Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.1231987Z 2025-05-07T19:58:26.1233618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.1235382Z 2025-05-07T19:58:26.4722837Z [329/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o 2025-05-07T19:58:26.4740546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.4741972Z 2025-05-07T19:58:26.4743190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.4744769Z 2025-05-07T19:58:26.4746004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.4747384Z 2025-05-07T19:58:26.4748618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.4749999Z 2025-05-07T19:58:26.4751227Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.4753049Z 2025-05-07T19:58:26.4754678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:26.4756499Z 2025-05-07T19:58:38.8906588Z [330/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:38.8928018Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.8930072Z 2025-05-07T19:58:38.8931855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.8933424Z 2025-05-07T19:58:38.8934956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.8936706Z 2025-05-07T19:58:38.8938196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.8939851Z 2025-05-07T19:58:38.8941343Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.8943019Z 2025-05-07T19:58:38.8944596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:38.8946385Z 2025-05-07T19:58:39.5870009Z [331/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:39.5893202Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.5895277Z 2025-05-07T19:58:39.5896842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.5898712Z 2025-05-07T19:58:39.5900475Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.5902423Z 2025-05-07T19:58:39.5904165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.5906105Z 2025-05-07T19:58:39.5907784Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.5909712Z 2025-05-07T19:58:39.5911434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.5913539Z 2025-05-07T19:58:39.7456381Z [332/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o 2025-05-07T19:58:39.7479744Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.7481621Z 2025-05-07T19:58:39.7483237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.7484965Z 2025-05-07T19:58:39.7486471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.7488375Z 2025-05-07T19:58:39.7490093Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.7492038Z 2025-05-07T19:58:39.7493770Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.7495690Z 2025-05-07T19:58:39.7497421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:39.7499360Z 2025-05-07T19:58:40.1367779Z [333/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o 2025-05-07T19:58:40.1389322Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.1391003Z 2025-05-07T19:58:40.1392435Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.1394237Z 2025-05-07T19:58:40.1395676Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.1397320Z 2025-05-07T19:58:40.1398905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.1400561Z 2025-05-07T19:58:40.1402053Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.1403708Z 2025-05-07T19:58:40.1405276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.1406830Z 2025-05-07T19:58:40.9718058Z [334/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o 2025-05-07T19:58:40.9741430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.9743373Z 2025-05-07T19:58:40.9745043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.9746909Z 2025-05-07T19:58:40.9748591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.9750421Z 2025-05-07T19:58:40.9751962Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.9753936Z 2025-05-07T19:58:40.9755569Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.9757337Z 2025-05-07T19:58:40.9758979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:40.9760788Z 2025-05-07T19:58:41.1449585Z [335/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:58:41.1472096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.1474117Z 2025-05-07T19:58:41.1475703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.1477586Z 2025-05-07T19:58:41.1479245Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.1481147Z 2025-05-07T19:58:41.1482836Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.1484505Z 2025-05-07T19:58:41.1485877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.1487441Z 2025-05-07T19:58:41.1488881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:41.1490572Z 2025-05-07T19:58:42.4715162Z [336/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:42.4735572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.4737329Z 2025-05-07T19:58:42.4738839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.4740534Z 2025-05-07T19:58:42.4741964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.4743627Z 2025-05-07T19:58:42.4745089Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.4746840Z 2025-05-07T19:58:42.4748270Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.4749881Z 2025-05-07T19:58:42.4751360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:42.4753171Z 2025-05-07T19:58:44.2593671Z [337/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o 2025-05-07T19:58:44.2616692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2618634Z 2025-05-07T19:58:44.2620380Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2622423Z 2025-05-07T19:58:44.2624181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2626160Z 2025-05-07T19:58:44.2627813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2629783Z 2025-05-07T19:58:44.2631817Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2633745Z 2025-05-07T19:58:44.2635470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.2637735Z 2025-05-07T19:58:44.4967117Z [338/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_weighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o 2025-05-07T19:58:44.4990438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4992456Z 2025-05-07T19:58:44.4994338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4996284Z 2025-05-07T19:58:44.4997877Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.4999767Z 2025-05-07T19:58:44.5001544Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.5003462Z 2025-05-07T19:58:44.5005184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.5007025Z 2025-05-07T19:58:44.5008731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:44.5011027Z 2025-05-07T19:58:47.0007337Z [339/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o 2025-05-07T19:58:47.0029206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0030938Z 2025-05-07T19:58:47.0032964Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0034800Z 2025-05-07T19:58:47.0036559Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0038478Z 2025-05-07T19:58:47.0040272Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0042073Z 2025-05-07T19:58:47.0043471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0045443Z 2025-05-07T19:58:47.0047050Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0048856Z 2025-05-07T19:58:47.0953695Z [340/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o 2025-05-07T19:58:47.0975956Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0977794Z 2025-05-07T19:58:47.0979404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0981234Z 2025-05-07T19:58:47.0982738Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0984450Z 2025-05-07T19:58:47.0986086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0988005Z 2025-05-07T19:58:47.0989708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0992041Z 2025-05-07T19:58:47.0993945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:47.0995536Z 2025-05-07T19:58:48.2216265Z [341/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp 2025-05-07T19:58:48.2236641Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:48.7884168Z [342/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o 2025-05-07T19:58:48.7901910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.7903330Z 2025-05-07T19:58:48.7904602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.7906175Z 2025-05-07T19:58:48.7907580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.7909157Z 2025-05-07T19:58:48.7910521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.7912049Z 2025-05-07T19:58:48.7913561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.7915053Z 2025-05-07T19:58:48.7916411Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:48.7917936Z 2025-05-07T19:58:48.8047050Z [343/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp 2025-05-07T19:58:48.8062774Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:49.4554668Z [344/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:49.4573874Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:49.7361238Z [345/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp 2025-05-07T19:58:49.7379753Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:50.7460884Z [346/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:58:50.7483485Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.7484978Z 2025-05-07T19:58:50.7486318Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.7488169Z 2025-05-07T19:58:50.7489730Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.7491516Z 2025-05-07T19:58:50.7493081Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.7494848Z 2025-05-07T19:58:50.7496403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.7498174Z 2025-05-07T19:58:50.7499638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:50.7501436Z 2025-05-07T19:58:51.1870093Z [347/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp 2025-05-07T19:58:51.1892757Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:52.6633574Z [348/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp 2025-05-07T19:58:52.6654050Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:52.8054685Z [349/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o 2025-05-07T19:58:52.8075944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8077909Z 2025-05-07T19:58:52.8079443Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8081184Z 2025-05-07T19:58:52.8082882Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8084704Z 2025-05-07T19:58:52.8086188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8087851Z 2025-05-07T19:58:52.8089307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8091115Z 2025-05-07T19:58:52.8092691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:52.8094360Z 2025-05-07T19:58:53.0397219Z [350/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:53.0417435Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:55.5533988Z [351/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:55.5554029Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:56.9511006Z [352/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:56.9527316Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:57.1355108Z [353/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp 2025-05-07T19:58:57.1375077Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:57.1507332Z [354/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp 2025-05-07T19:58:57.1526258Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:57.2059376Z [355/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp 2025-05-07T19:58:57.2078636Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T19:58:57.3522678Z [356/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o 2025-05-07T19:58:57.3543835Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.3545613Z 2025-05-07T19:58:57.3547139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.3548855Z 2025-05-07T19:58:57.3550340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.3552270Z 2025-05-07T19:58:57.3553931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.3555675Z 2025-05-07T19:58:57.3557333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.3558993Z 2025-05-07T19:58:57.3560574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:57.3562384Z 2025-05-07T19:58:58.0573059Z [357/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:58:58.0596771Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.0598713Z 2025-05-07T19:58:58.0600406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.0602385Z 2025-05-07T19:58:58.0604000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.0605779Z 2025-05-07T19:58:58.0607691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.0609555Z 2025-05-07T19:58:58.0611078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.0612832Z 2025-05-07T19:58:58.0614424Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:58:58.0616335Z 2025-05-07T19:59:00.1497024Z [358/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o 2025-05-07T19:59:00.1520258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:00.1522666Z 2025-05-07T19:59:00.1524358Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:00.1525999Z 2025-05-07T19:59:00.1527796Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:00.1529430Z 2025-05-07T19:59:00.1530808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:00.1532655Z 2025-05-07T19:59:00.1533574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:00.1534704Z 2025-05-07T19:59:00.1535578Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:00.1536576Z 2025-05-07T19:59:03.9704200Z [359/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o 2025-05-07T19:59:03.9726925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.9728347Z 2025-05-07T19:59:03.9730206Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.9732345Z 2025-05-07T19:59:03.9733976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.9735948Z 2025-05-07T19:59:03.9737708Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.9739669Z 2025-05-07T19:59:03.9741211Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.9742885Z 2025-05-07T19:59:03.9744457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:03.9746178Z 2025-05-07T19:59:05.0239129Z [360/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o 2025-05-07T19:59:05.0259405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0261593Z 2025-05-07T19:59:05.0263139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0264757Z 2025-05-07T19:59:05.0266140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0267714Z 2025-05-07T19:59:05.0269147Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0271192Z 2025-05-07T19:59:05.0272928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0274634Z 2025-05-07T19:59:05.0276172Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:05.0278010Z 2025-05-07T19:59:06.0341951Z [361/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:06.0365898Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0367908Z 2025-05-07T19:59:06.0369697Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0371667Z 2025-05-07T19:59:06.0373287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0375296Z 2025-05-07T19:59:06.0376788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0378475Z 2025-05-07T19:59:06.0379924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0381452Z 2025-05-07T19:59:06.0382881Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:06.0384594Z 2025-05-07T19:59:09.5056463Z [362/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o 2025-05-07T19:59:09.5079098Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5080921Z 2025-05-07T19:59:09.5082566Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5084705Z 2025-05-07T19:59:09.5086104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5087812Z 2025-05-07T19:59:09.5089469Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5091438Z 2025-05-07T19:59:09.5092931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5094744Z 2025-05-07T19:59:09.5096263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.5098135Z 2025-05-07T19:59:09.6386467Z [363/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:09.6409874Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6412024Z 2025-05-07T19:59:09.6413661Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6415560Z 2025-05-07T19:59:09.6417140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6419114Z 2025-05-07T19:59:09.6420799Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6422798Z 2025-05-07T19:59:09.6424513Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6426418Z 2025-05-07T19:59:09.6428164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:09.6429737Z 2025-05-07T19:59:10.0306665Z [364/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o 2025-05-07T19:59:10.0328225Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.0329935Z 2025-05-07T19:59:10.0331777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.0333502Z 2025-05-07T19:59:10.0334985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.0336723Z 2025-05-07T19:59:10.0338276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.0339896Z 2025-05-07T19:59:10.0341451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.0342954Z 2025-05-07T19:59:10.0344487Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.0346279Z 2025-05-07T19:59:10.6708684Z [365/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o 2025-05-07T19:59:10.6733348Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6735367Z 2025-05-07T19:59:10.6737139Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6739126Z 2025-05-07T19:59:10.6740847Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6742790Z 2025-05-07T19:59:10.6744428Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6746318Z 2025-05-07T19:59:10.6748036Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6749884Z 2025-05-07T19:59:10.6751624Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:10.6753678Z 2025-05-07T19:59:11.8925264Z [366/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o 2025-05-07T19:59:11.8946321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8948003Z 2025-05-07T19:59:11.8949421Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8951043Z 2025-05-07T19:59:11.8952403Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8954085Z 2025-05-07T19:59:11.8955552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8957110Z 2025-05-07T19:59:11.8958511Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8960081Z 2025-05-07T19:59:11.8961467Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:11.8963093Z 2025-05-07T19:59:15.2617220Z [367/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:15.2639264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2641092Z 2025-05-07T19:59:15.2642769Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2644538Z 2025-05-07T19:59:15.2646107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2647969Z 2025-05-07T19:59:15.2649476Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2651241Z 2025-05-07T19:59:15.2652677Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2654561Z 2025-05-07T19:59:15.2656119Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.2657896Z 2025-05-07T19:59:15.4564780Z [368/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o 2025-05-07T19:59:15.4587222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4589048Z 2025-05-07T19:59:15.4590669Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4592482Z 2025-05-07T19:59:15.4594167Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4596010Z 2025-05-07T19:59:15.4597561Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4599375Z 2025-05-07T19:59:15.4600984Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4602760Z 2025-05-07T19:59:15.4604376Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:15.4606172Z 2025-05-07T19:59:16.7978096Z [369/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T19:59:16.8001122Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.8002995Z 2025-05-07T19:59:16.8004488Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.8006345Z 2025-05-07T19:59:16.8007821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.8009618Z 2025-05-07T19:59:16.8011163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.8012960Z 2025-05-07T19:59:16.8014481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.8016561Z 2025-05-07T19:59:16.8018180Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:16.8019960Z 2025-05-07T19:59:19.1902216Z [370/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o 2025-05-07T19:59:19.1932692Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1935057Z 2025-05-07T19:59:19.1937239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1939695Z 2025-05-07T19:59:19.1941842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1944269Z 2025-05-07T19:59:19.1946442Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1949044Z 2025-05-07T19:59:19.1951048Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1953479Z 2025-05-07T19:59:19.1955471Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.1957765Z 2025-05-07T19:59:19.2925238Z [371/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o 2025-05-07T19:59:19.2955571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.2958070Z 2025-05-07T19:59:19.2960259Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.2962592Z 2025-05-07T19:59:19.2964596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.2966930Z 2025-05-07T19:59:19.2969208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.2971516Z 2025-05-07T19:59:19.2973546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.2975823Z 2025-05-07T19:59:19.2978188Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:19.2980460Z 2025-05-07T19:59:22.1521605Z [372/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o 2025-05-07T19:59:22.1543107Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.1544892Z 2025-05-07T19:59:22.1546490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.1548235Z 2025-05-07T19:59:22.1549719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.1551730Z 2025-05-07T19:59:22.1553350Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.1555043Z 2025-05-07T19:59:22.1556869Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.1558571Z 2025-05-07T19:59:22.1560099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:22.1561841Z 2025-05-07T19:59:28.1310615Z [373/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:28.1333520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:28.1335348Z 2025-05-07T19:59:28.1336945Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:28.1338996Z 2025-05-07T19:59:28.1340580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:28.1342361Z 2025-05-07T19:59:28.1344185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:28.1345935Z 2025-05-07T19:59:28.1347481Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:28.1349194Z 2025-05-07T19:59:28.1350728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:28.1352585Z 2025-05-07T19:59:31.7229630Z [374/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o 2025-05-07T19:59:31.7251966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.7254298Z 2025-05-07T19:59:31.7255587Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.7257298Z 2025-05-07T19:59:31.7258949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.7260521Z 2025-05-07T19:59:31.7262061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.7263760Z 2025-05-07T19:59:31.7265150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.7266810Z 2025-05-07T19:59:31.7268314Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:31.7270034Z 2025-05-07T19:59:34.4602048Z [375/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o 2025-05-07T19:59:34.4624891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:34.4626787Z 2025-05-07T19:59:34.4628185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:34.4630389Z 2025-05-07T19:59:34.4632153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:34.4634043Z 2025-05-07T19:59:34.4635571Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:34.4637482Z 2025-05-07T19:59:34.4639192Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:34.4640910Z 2025-05-07T19:59:34.4642394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:34.4643833Z 2025-05-07T19:59:37.5898505Z [376/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T19:59:37.5921782Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.5923534Z 2025-05-07T19:59:37.5925072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.5926838Z 2025-05-07T19:59:37.5928360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.5930099Z 2025-05-07T19:59:37.5932146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.5933851Z 2025-05-07T19:59:37.5935396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.5937100Z 2025-05-07T19:59:37.5938596Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:37.5940349Z 2025-05-07T19:59:39.6318427Z [377/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o 2025-05-07T19:59:39.6340837Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.6342637Z 2025-05-07T19:59:39.6344207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.6346020Z 2025-05-07T19:59:39.6347891Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.6349649Z 2025-05-07T19:59:39.6351177Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.6353083Z 2025-05-07T19:59:39.6354604Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.6356385Z 2025-05-07T19:59:39.6357972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:39.6359736Z 2025-05-07T19:59:45.3642363Z [378/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:45.3659630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.3661064Z 2025-05-07T19:59:45.3662297Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.3663919Z 2025-05-07T19:59:45.3665135Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.3666488Z 2025-05-07T19:59:45.3667689Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.3669073Z 2025-05-07T19:59:45.3670263Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.3671619Z 2025-05-07T19:59:45.3672979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.3674343Z 2025-05-07T19:59:45.6044701Z [379/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o 2025-05-07T19:59:45.6061820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6063420Z 2025-05-07T19:59:45.6064642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6066020Z 2025-05-07T19:59:45.6067203Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6068559Z 2025-05-07T19:59:45.6069772Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6071139Z 2025-05-07T19:59:45.6072340Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6073803Z 2025-05-07T19:59:45.6075026Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:45.6076395Z 2025-05-07T19:59:47.2364424Z [380/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T19:59:47.2389932Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:47.2391831Z 2025-05-07T19:59:47.2393575Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:47.2395607Z 2025-05-07T19:59:47.2397405Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:47.2399417Z 2025-05-07T19:59:47.2401201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:47.2403241Z 2025-05-07T19:59:47.2405030Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:47.2407042Z 2025-05-07T19:59:47.2408673Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T19:59:47.2410552Z 2025-05-07T19:59:55.4028583Z [381/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_meta.cpp 2025-05-07T19:59:55.4049055Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:00.2120593Z [382/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o 2025-05-07T20:00:00.2144602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.2146838Z 2025-05-07T20:00:00.2148581Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.2150627Z 2025-05-07T20:00:00.2152563Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.2154612Z 2025-05-07T20:00:00.2156254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.2158205Z 2025-05-07T20:00:00.2159933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.2162002Z 2025-05-07T20:00:00.2163888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:00.2165823Z 2025-05-07T20:00:03.6229980Z [383/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp 2025-05-07T20:00:03.6250870Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:06.6284074Z [384/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:00:06.6307200Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.6309023Z 2025-05-07T20:00:06.6310465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.6312344Z 2025-05-07T20:00:06.6314150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.6316082Z 2025-05-07T20:00:06.6317777Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.6319708Z 2025-05-07T20:00:06.6321396Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.6323279Z 2025-05-07T20:00:06.6324982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.6327125Z 2025-05-07T20:00:06.7800387Z [385/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:06.7822996Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.7824829Z 2025-05-07T20:00:06.7826457Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.7828346Z 2025-05-07T20:00:06.7829919Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.7831965Z 2025-05-07T20:00:06.7833630Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.7835405Z 2025-05-07T20:00:06.7837061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.7839239Z 2025-05-07T20:00:06.7840928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:06.7842710Z 2025-05-07T20:00:07.7274776Z [386/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:07.7299014Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.7300965Z 2025-05-07T20:00:07.7302656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.7304569Z 2025-05-07T20:00:07.7306222Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.7308089Z 2025-05-07T20:00:07.7309715Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.7311699Z 2025-05-07T20:00:07.7313121Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.7314603Z 2025-05-07T20:00:07.7316046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:07.7317612Z 2025-05-07T20:00:08.7009866Z [387/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_meta.cpp 2025-05-07T20:00:08.7028407Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:08.7421151Z [388/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:08.7441394Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.7442880Z 2025-05-07T20:00:08.7444181Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.7445682Z 2025-05-07T20:00:08.7446965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.7448420Z 2025-05-07T20:00:08.7449726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.7451173Z 2025-05-07T20:00:08.7452461Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.7454016Z 2025-05-07T20:00:08.7455362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:08.7456951Z 2025-05-07T20:00:10.1152743Z [389/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:10.1173855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1175674Z 2025-05-07T20:00:10.1177195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1178984Z 2025-05-07T20:00:10.1180533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1182258Z 2025-05-07T20:00:10.1183798Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1185568Z 2025-05-07T20:00:10.1187054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1188786Z 2025-05-07T20:00:10.1190316Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.1192045Z 2025-05-07T20:00:10.5485540Z [390/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:10.5506923Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5508702Z 2025-05-07T20:00:10.5510108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5511821Z 2025-05-07T20:00:10.5513555Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5515480Z 2025-05-07T20:00:10.5517075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5518908Z 2025-05-07T20:00:10.5520400Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5522306Z 2025-05-07T20:00:10.5523926Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.5525758Z 2025-05-07T20:00:10.6957994Z [391/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o 2025-05-07T20:00:10.6979313Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.6980953Z 2025-05-07T20:00:10.6982500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.6984382Z 2025-05-07T20:00:10.6985935Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.6987824Z 2025-05-07T20:00:10.6989382Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.6991069Z 2025-05-07T20:00:10.6992813Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.6994625Z 2025-05-07T20:00:10.6996164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:10.6997882Z 2025-05-07T20:00:11.3537264Z [392/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:11.3558971Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3560804Z 2025-05-07T20:00:11.3562415Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3564214Z 2025-05-07T20:00:11.3565802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3567535Z 2025-05-07T20:00:11.3568985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3570876Z 2025-05-07T20:00:11.3572579Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3574397Z 2025-05-07T20:00:11.3576046Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:11.3577919Z 2025-05-07T20:00:12.0662992Z [393/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:12.0684788Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0686675Z 2025-05-07T20:00:12.0688362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0690238Z 2025-05-07T20:00:12.0691886Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0693804Z 2025-05-07T20:00:12.0695317Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0697240Z 2025-05-07T20:00:12.0698685Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0700776Z 2025-05-07T20:00:12.0702490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.0704305Z 2025-05-07T20:00:12.3636377Z [394/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:12.3658811Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.3660557Z 2025-05-07T20:00:12.3661944Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.3663573Z 2025-05-07T20:00:12.3665047Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.3666693Z 2025-05-07T20:00:12.3668142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.3669689Z 2025-05-07T20:00:12.3671334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.3673240Z 2025-05-07T20:00:12.3674613Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:12.3676127Z 2025-05-07T20:00:14.5218817Z [395/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:14.5241617Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.5243539Z 2025-05-07T20:00:14.5245237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.5246877Z 2025-05-07T20:00:14.5248431Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.5250345Z 2025-05-07T20:00:14.5251925Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.5254018Z 2025-05-07T20:00:14.5255574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.5257284Z 2025-05-07T20:00:14.5259258Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:14.5261038Z 2025-05-07T20:00:15.2001144Z [396/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:15.2023286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.2025082Z 2025-05-07T20:00:15.2026642Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.2028390Z 2025-05-07T20:00:15.2029947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.2032463Z 2025-05-07T20:00:15.2034108Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.2035901Z 2025-05-07T20:00:15.2037648Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.2039445Z 2025-05-07T20:00:15.2041001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:15.2042808Z 2025-05-07T20:00:17.4705444Z [397/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_gwd_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o 2025-05-07T20:00:17.4727009Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.4728712Z 2025-05-07T20:00:17.4730238Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.4732573Z 2025-05-07T20:00:17.4734061Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.4735815Z 2025-05-07T20:00:17.4737622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.4739437Z 2025-05-07T20:00:17.4740924Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.4742605Z 2025-05-07T20:00:17.4744142Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:17.4746119Z 2025-05-07T20:00:19.7085994Z [398/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:19.7107678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7109877Z 2025-05-07T20:00:19.7111426Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7113280Z 2025-05-07T20:00:19.7114823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7116554Z 2025-05-07T20:00:19.7118500Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7120234Z 2025-05-07T20:00:19.7121761Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7123478Z 2025-05-07T20:00:19.7125077Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:19.7126968Z 2025-05-07T20:00:21.3125980Z [399/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:21.3149349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.3151291Z 2025-05-07T20:00:21.3153065Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.3154938Z 2025-05-07T20:00:21.3156974Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.3158905Z 2025-05-07T20:00:21.3160664Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.3162633Z 2025-05-07T20:00:21.3164362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.3166431Z 2025-05-07T20:00:21.3168164Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:21.3170104Z 2025-05-07T20:00:23.1136535Z [400/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:23.1160029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.1161901Z 2025-05-07T20:00:23.1164002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.1165961Z 2025-05-07T20:00:23.1167562Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.1169418Z 2025-05-07T20:00:23.1171001Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.1173196Z 2025-05-07T20:00:23.1174966Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.1176984Z 2025-05-07T20:00:23.1178779Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.1180806Z 2025-05-07T20:00:23.3299611Z [401/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_dense.cpp 2025-05-07T20:00:23.3319443Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:23.6921485Z [402/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:23.6945360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.6947294Z 2025-05-07T20:00:23.6948953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.6950970Z 2025-05-07T20:00:23.6952572Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.6954453Z 2025-05-07T20:00:23.6956140Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.6958026Z 2025-05-07T20:00:23.6959686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.6961949Z 2025-05-07T20:00:23.6963628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:23.6965472Z 2025-05-07T20:00:24.3406242Z [403/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:24.3429000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3430929Z 2025-05-07T20:00:24.3433066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3435085Z 2025-05-07T20:00:24.3436823Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3438745Z 2025-05-07T20:00:24.3440370Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3442182Z 2025-05-07T20:00:24.3444155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3446100Z 2025-05-07T20:00:24.3447781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3449760Z 2025-05-07T20:00:24.3704043Z [404/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:24.3726096Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3727973Z 2025-05-07T20:00:24.3729521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3731567Z 2025-05-07T20:00:24.3733153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3735038Z 2025-05-07T20:00:24.3736768Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3738989Z 2025-05-07T20:00:24.3740625Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3742406Z 2025-05-07T20:00:24.3744210Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.3745980Z 2025-05-07T20:00:24.5435381Z [405/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:24.5458998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.5460971Z 2025-05-07T20:00:24.5462723Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.5464641Z 2025-05-07T20:00:24.5466278Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.5468536Z 2025-05-07T20:00:24.5470321Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.5472342Z 2025-05-07T20:00:24.5474207Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.5476260Z 2025-05-07T20:00:24.5477973Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.5479931Z 2025-05-07T20:00:24.8873673Z [406/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:24.8896860Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.8898844Z 2025-05-07T20:00:24.8900618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.8902527Z 2025-05-07T20:00:24.8904459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.8906294Z 2025-05-07T20:00:24.8908044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.8909995Z 2025-05-07T20:00:24.8911873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.8913896Z 2025-05-07T20:00:24.8915565Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:24.8917475Z 2025-05-07T20:00:26.1979104Z [407/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:26.2003527Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.2005470Z 2025-05-07T20:00:26.2007068Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.2009324Z 2025-05-07T20:00:26.2011021Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.2012888Z 2025-05-07T20:00:26.2014778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.2016643Z 2025-05-07T20:00:26.2018254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.2020061Z 2025-05-07T20:00:26.2021737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.2023615Z 2025-05-07T20:00:26.6224813Z [408/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:26.6248976Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6251385Z 2025-05-07T20:00:26.6253055Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6255043Z 2025-05-07T20:00:26.6257000Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6258790Z 2025-05-07T20:00:26.6260434Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6262318Z 2025-05-07T20:00:26.6263955Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6265849Z 2025-05-07T20:00:26.6267626Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6269560Z 2025-05-07T20:00:26.6431104Z [409/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:26.6455492Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6457540Z 2025-05-07T20:00:26.6459247Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6461560Z 2025-05-07T20:00:26.6463232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6464998Z 2025-05-07T20:00:26.6466716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6468640Z 2025-05-07T20:00:26.6470335Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6472398Z 2025-05-07T20:00:26.6474294Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.6476232Z 2025-05-07T20:00:26.9417630Z [410/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:26.9437393Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9439005Z 2025-05-07T20:00:26.9440999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9442685Z 2025-05-07T20:00:26.9444153Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9445851Z 2025-05-07T20:00:26.9447367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9449184Z 2025-05-07T20:00:26.9450690Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9452491Z 2025-05-07T20:00:26.9454090Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:26.9455922Z 2025-05-07T20:00:27.2775710Z [411/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:27.2801155Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2803106Z 2025-05-07T20:00:27.2804834Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2806766Z 2025-05-07T20:00:27.2808552Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2810722Z 2025-05-07T20:00:27.2812526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2814557Z 2025-05-07T20:00:27.2816329Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2818277Z 2025-05-07T20:00:27.2820080Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.2822001Z 2025-05-07T20:00:27.7865146Z [412/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:27.7885584Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.7887408Z 2025-05-07T20:00:27.7888982Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.7891045Z 2025-05-07T20:00:27.7892586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.7894276Z 2025-05-07T20:00:27.7895833Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.7897589Z 2025-05-07T20:00:27.7899184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.7901065Z 2025-05-07T20:00:27.7902623Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:27.7904395Z 2025-05-07T20:00:28.6672843Z [413/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:28.6697158Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6699366Z 2025-05-07T20:00:28.6701123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6702992Z 2025-05-07T20:00:28.6704686Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6706566Z 2025-05-07T20:00:28.6708290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6710261Z 2025-05-07T20:00:28.6711954Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6713962Z 2025-05-07T20:00:28.6715655Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:28.6717580Z 2025-05-07T20:00:29.0396481Z [414/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:29.0417726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0419757Z 2025-05-07T20:00:29.0421379Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0423149Z 2025-05-07T20:00:29.0424726Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0426428Z 2025-05-07T20:00:29.0427948Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0429741Z 2025-05-07T20:00:29.0431680Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0433580Z 2025-05-07T20:00:29.0435118Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.0436828Z 2025-05-07T20:00:29.4281942Z [415/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:29.4294216Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.4295197Z 2025-05-07T20:00:29.4296084Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.4297068Z 2025-05-07T20:00:29.4297917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.4298902Z 2025-05-07T20:00:29.4299766Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.4300755Z 2025-05-07T20:00:29.4301620Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.4302578Z 2025-05-07T20:00:29.4303451Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:29.4304442Z 2025-05-07T20:00:30.5493039Z [416/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:30.5528115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:30.5529914Z 2025-05-07T20:00:30.5531879Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:30.5533711Z 2025-05-07T20:00:30.5535287Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:30.5537084Z 2025-05-07T20:00:30.5538709Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:30.5540535Z 2025-05-07T20:00:30.5542127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:30.5543968Z 2025-05-07T20:00:30.5545504Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:30.5547295Z 2025-05-07T20:00:31.5109798Z [417/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adagrad.cpp 2025-05-07T20:00:31.5127434Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:32.5928173Z [418/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:32.5949975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.5951770Z 2025-05-07T20:00:32.5953344Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.5955416Z 2025-05-07T20:00:32.5956731Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:32.5958185Z 2025-05-07T20:00:32.5959719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.5961434Z 2025-05-07T20:00:32.5962947Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.5964894Z 2025-05-07T20:00:32.5966166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu(245): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:32.5967615Z 2025-05-07T20:00:32.5969105Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.5970838Z 2025-05-07T20:00:32.5972373Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:32.5974164Z 2025-05-07T20:00:33.9820681Z [419/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o 2025-05-07T20:00:33.9841508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.9843241Z 2025-05-07T20:00:33.9844732Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.9846781Z 2025-05-07T20:00:33.9848277Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.9849932Z 2025-05-07T20:00:33.9851459Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.9853155Z 2025-05-07T20:00:33.9854595Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.9856328Z 2025-05-07T20:00:33.9857797Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:33.9859516Z 2025-05-07T20:00:34.1456870Z [420/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:34.1477743Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1479734Z 2025-05-07T20:00:34.1481276Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1482972Z 2025-05-07T20:00:34.1484420Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1486066Z 2025-05-07T20:00:34.1487586Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1489274Z 2025-05-07T20:00:34.1490792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1492444Z 2025-05-07T20:00:34.1493917Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:34.1495544Z 2025-05-07T20:00:35.3159431Z [421/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:35.3178523Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.6947972Z [422/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_sgd.cpp 2025-05-07T20:00:35.6963434Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.7382039Z [423/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_sgd.cpp 2025-05-07T20:00:35.7396499Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.9342823Z [424/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad.cpp 2025-05-07T20:00:35.9362218Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:35.9827642Z [425/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:35.9848340Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.1554453Z [426/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp 2025-05-07T20:00:36.1574356Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.3280019Z [427/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:36.3290663Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:36.5814909Z [428/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:36.5825596Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.1295141Z [429/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_weighted_meta.cpp 2025-05-07T20:00:37.1314283Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.4354579Z [430/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp 2025-05-07T20:00:37.4373147Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.9104033Z [431/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp 2025-05-07T20:00:37.9114742Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:37.9352959Z [432/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:37.9363367Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.0928241Z [433/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp 2025-05-07T20:00:38.0947947Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.3481027Z [434/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lamb.cpp 2025-05-07T20:00:38.3499274Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:38.8233932Z [435/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:38.8258638Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.8260574Z 2025-05-07T20:00:38.8262309Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.8264284Z 2025-05-07T20:00:38.8265979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.8267913Z 2025-05-07T20:00:38.8269649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.8271595Z 2025-05-07T20:00:38.8273465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.8275383Z 2025-05-07T20:00:38.8277103Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:38.8279088Z 2025-05-07T20:00:38.8369641Z [436/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_weighted_meta.cpp 2025-05-07T20:00:38.8389846Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:39.0232105Z [437/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_lars_sgd.cpp 2025-05-07T20:00:39.0251938Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:39.6029817Z [438/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_forward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o 2025-05-07T20:00:39.6042190Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.6043213Z 2025-05-07T20:00:39.6044086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.6045067Z 2025-05-07T20:00:39.6045808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:39.6046647Z 2025-05-07T20:00:39.6047499Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.6048472Z 2025-05-07T20:00:39.6049330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.6050305Z 2025-05-07T20:00:39.6051045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu(236): warning #177-D: variable "offset_idx" was declared but never referenced 2025-05-07T20:00:39.6051876Z 2025-05-07T20:00:39.6052745Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.6053810Z 2025-05-07T20:00:39.6054701Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:39.6055690Z 2025-05-07T20:00:39.9169413Z [439/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_meta.cpp 2025-05-07T20:00:39.9187229Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:40.6966750Z [440/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_weighted_meta.cpp 2025-05-07T20:00:40.6988089Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:40.8743397Z [441/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_forward.so -o fbgemm_gpu_tbe_training_forward.so CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cpu_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_pt2_cuda_wrapper.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_weighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_weighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_v2_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_vbe_gwd_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_vbe_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_split_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_dense_unweighted_nobag_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_training_forward.dir/gen_embedding_forward_ssd_unweighted_nobag_kernel_small.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_common.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:00:40.9765297Z [442/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_adam.cpp 2025-05-07T20:00:40.9784102Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:41.1042428Z [443/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o 2025-05-07T20:00:41.1064990Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1066775Z 2025-05-07T20:00:41.1068366Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1070144Z 2025-05-07T20:00:41.1071671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1073580Z 2025-05-07T20:00:41.1075045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1076678Z 2025-05-07T20:00:41.1078204Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1079945Z 2025-05-07T20:00:41.1081490Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:41.1083147Z 2025-05-07T20:00:41.4526437Z [444/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_adam.cpp 2025-05-07T20:00:41.4537086Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:41.5027053Z [445/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_none.cpp 2025-05-07T20:00:41.5045057Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.0801474Z [446/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o 2025-05-07T20:00:42.0822546Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.0824046Z 2025-05-07T20:00:42.0825268Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.0826652Z 2025-05-07T20:00:42.0827930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.0829557Z 2025-05-07T20:00:42.0831129Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.0833286Z 2025-05-07T20:00:42.0834803Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.0836545Z 2025-05-07T20:00:42.0838099Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:42.0839818Z 2025-05-07T20:00:42.1586888Z [447/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp 2025-05-07T20:00:42.1605088Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.2310831Z [448/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_ops.cpp 2025-05-07T20:00:42.2329231Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:42.7552736Z [449/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:42.7570192Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.4643129Z [450/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp 2025-05-07T20:00:43.4662009Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.5374944Z [451/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_partial_rowwise_lamb.cpp 2025-05-07T20:00:43.5394226Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:43.6265442Z [452/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad.cpp 2025-05-07T20:00:43.6284797Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.1169220Z [453/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp 2025-05-07T20:00:44.1189099Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.1565788Z [454/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp 2025-05-07T20:00:44.1585078Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.3358476Z [455/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp 2025-05-07T20:00:44.3377811Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.3636917Z [456/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp 2025-05-07T20:00:44.3656596Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.4794382Z [457/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_weighted_meta.cpp 2025-05-07T20:00:44.4812807Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:44.5322862Z [458/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_none_split_unweighted_meta.cpp 2025-05-07T20:00:44.5342362Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.1358374Z [459/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp 2025-05-07T20:00:45.1379840Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:45.6902398Z [460/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp 2025-05-07T20:00:45.6922232Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:48.6639376Z [461/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cpp 2025-05-07T20:00:48.6654509Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:49.5796556Z [462/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cpp 2025-05-07T20:00:49.5811946Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:49.6813089Z [463/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_ssd_rowwise_adagrad.cpp 2025-05-07T20:00:49.6838358Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:50.2156187Z [464/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_models.cpp 2025-05-07T20:00:50.2173859Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:50.5745773Z [465/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:50.5765165Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:50.6430323Z [466/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_split_host_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp 2025-05-07T20:00:50.6452207Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:51.3910352Z [467/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_split_host.so -o fbgemm_gpu_tbe_training_backward_split_host.so CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_lars_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_adam.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_partial_rowwise_lamb.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_none.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_sgd.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_split_rowwise_weighted_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_ssd_rowwise_adagrad.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_lars_sgd_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_none_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_split_host.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_meta.cpp.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so && : 2025-05-07T20:00:52.2739636Z [468/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp 2025-05-07T20:00:52.2758468Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:52.5169551Z [469/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp 2025-05-07T20:00:52.5187333Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:52.5817815Z [470/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_cpu.cpp 2025-05-07T20:00:52.5836396Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:53.9596933Z [471/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_meta.cpp 2025-05-07T20:00:53.9613188Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:54.5827825Z [472/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp 2025-05-07T20:00:54.5845644Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:54.6393084Z [473/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp 2025-05-07T20:00:54.6410089Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:54.6773380Z [474/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_cpu.cpp 2025-05-07T20:00:54.6790118Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:54.8876338Z [475/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cpp 2025-05-07T20:00:54.8893498Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.0857897Z [476/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/codegen/training/index_select/batch_index_select_dim0_host.cpp 2025-05-07T20:00:55.0875833Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.1787494Z [477/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp 2025-05-07T20:00:55.1805838Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:55.3246972Z [478/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp 2025-05-07T20:00:55.3265579Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.2990554Z [479/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp 2025-05-07T20:00:56.3008038Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:56.7342674Z [480/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_cpu.cpp 2025-05-07T20:00:56.7359418Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.2190198Z [481/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp 2025-05-07T20:00:57.2208129Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.4472032Z [482/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/topology_utils.cpp 2025-05-07T20:00:57.4488392Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:57.5060362Z [483/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o 2025-05-07T20:00:57.5082522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.5084305Z 2025-05-07T20:00:57.5085802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.5087626Z 2025-05-07T20:00:57.5089127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.5090801Z 2025-05-07T20:00:57.5092334Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.5093982Z 2025-05-07T20:00:57.5095494Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:57.5097182Z 2025-05-07T20:00:58.4265218Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:00:58.4267401Z 2025-05-07T20:00:58.4282554Z [484/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_meta.cpp 2025-05-07T20:00:58.4298780Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:00:59.0189364Z [485/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp 2025-05-07T20:00:59.0206744Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:00.2904625Z [486/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:00.2925300Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2927041Z 2025-05-07T20:01:00.2928533Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2930239Z 2025-05-07T20:01:00.2932074Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2933764Z 2025-05-07T20:01:00.2935282Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2937161Z 2025-05-07T20:01:00.2938637Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2940337Z 2025-05-07T20:01:00.2941717Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:00.2943398Z 2025-05-07T20:01:01.1744587Z [487/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_vbe_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o 2025-05-07T20:01:01.1770023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.1772125Z 2025-05-07T20:01:01.1773913Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.1775845Z 2025-05-07T20:01:01.1777526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.1779759Z 2025-05-07T20:01:01.1781473Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.1783437Z 2025-05-07T20:01:01.1785104Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.1786967Z 2025-05-07T20:01:01.1788863Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:01.1790717Z 2025-05-07T20:01:01.9786450Z [488/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops_gpu.cpp 2025-05-07T20:01:01.9804982Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:02.0858518Z [489/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:02.0882820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0884780Z 2025-05-07T20:01:02.0886463Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0888491Z 2025-05-07T20:01:02.0890254Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0892063Z 2025-05-07T20:01:02.0893729Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0895766Z 2025-05-07T20:01:02.0897535Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0899337Z 2025-05-07T20:01:02.0901004Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.0903019Z 2025-05-07T20:01:02.2253081Z [490/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:02.2275326Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.2277155Z 2025-05-07T20:01:02.2278748Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.2280617Z 2025-05-07T20:01:02.2282195Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.2284002Z 2025-05-07T20:01:02.2285580Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.2287339Z 2025-05-07T20:01:02.2288921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.2290536Z 2025-05-07T20:01:02.2292015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:02.2293712Z 2025-05-07T20:01:02.7651133Z [491/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops_host.cpp 2025-05-07T20:01:02.7670479Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.2905358Z [492/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine_gpu.cpp 2025-05-07T20:01:03.2922989Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:03.6638513Z [493/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/eeg_utils.cpp 2025-05-07T20:01:03.6655382Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.4072245Z [494/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_ops_gpu.cpp 2025-05-07T20:01:04.4089624Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.7116754Z [495/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator_ops.cpp 2025-05-07T20:01:04.7134878Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:04.9214958Z [496/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_estimator.cpp 2025-05-07T20:01:04.9231817Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.2290246Z [497/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp 2025-05-07T20:01:05.2309574Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:05.9352287Z [498/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator_ops.cpp 2025-05-07T20:01:05.9370615Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.1843223Z [499/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp 2025-05-07T20:01:07.1861414Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:07.2511634Z [500/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/tbe/eeg/indices_generator.cpp 2025-05-07T20:01:07.2529186Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:08.5087359Z [501/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp 2025-05-07T20:01:08.5106869Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:08.5601162Z [502/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o 2025-05-07T20:01:08.5622361Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.5624344Z 2025-05-07T20:01:08.5625842Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.5627568Z 2025-05-07T20:01:08.5628999Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.5630694Z 2025-05-07T20:01:08.5632614Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.5634356Z 2025-05-07T20:01:08.5635839Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.5637515Z 2025-05-07T20:01:08.5639044Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:08.5640756Z 2025-05-07T20:01:09.9686403Z [503/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp 2025-05-07T20:01:09.9703838Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:10.2570133Z [504/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp 2025-05-07T20:01:10.2588752Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.6963699Z [505/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_gpu.cpp 2025-05-07T20:01:12.6981520Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:12.8525306Z [506/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o 2025-05-07T20:01:12.8544607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.8546570Z 2025-05-07T20:01:12.8547911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.8549441Z 2025-05-07T20:01:12.8551015Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.8552642Z 2025-05-07T20:01:12.8554013Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.8555546Z 2025-05-07T20:01:12.8556875Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.8558425Z 2025-05-07T20:01:12.8559727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:12.8561176Z 2025-05-07T20:01:14.5560685Z [507/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o 2025-05-07T20:01:14.5573039Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5574024Z 2025-05-07T20:01:14.5574907Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5575890Z 2025-05-07T20:01:14.5577010Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5578010Z 2025-05-07T20:01:14.5578871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5579860Z 2025-05-07T20:01:14.5580711Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5581756Z 2025-05-07T20:01:14.5582682Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:14.5583663Z 2025-05-07T20:01:15.0465311Z [508/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o 2025-05-07T20:01:15.0488872Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.0490766Z 2025-05-07T20:01:15.0492774Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.0494691Z 2025-05-07T20:01:15.0496338Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.0498236Z 2025-05-07T20:01:15.0499927Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.0501932Z 2025-05-07T20:01:15.0503618Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.0505488Z 2025-05-07T20:01:15.0507166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.0509082Z 2025-05-07T20:01:15.3097994Z [509/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_weighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o 2025-05-07T20:01:15.3110205Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.3111200Z 2025-05-07T20:01:15.3112078Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.3113218Z 2025-05-07T20:01:15.3114086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.3115158Z 2025-05-07T20:01:15.3116022Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.3117011Z 2025-05-07T20:01:15.3117871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.3118844Z 2025-05-07T20:01:15.3119716Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:15.3120702Z 2025-05-07T20:01:16.8130850Z [510/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:01:16.8151554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.8153432Z 2025-05-07T20:01:16.8155075Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.8156798Z 2025-05-07T20:01:16.8158279Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.8160199Z 2025-05-07T20:01:16.8161740Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.8163511Z 2025-05-07T20:01:16.8164998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.8166626Z 2025-05-07T20:01:16.8168085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:16.8169730Z 2025-05-07T20:01:17.0371069Z [511/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_ops_cpu.cpp 2025-05-07T20:01:17.0387791Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:17.0533324Z [512/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o 2025-05-07T20:01:17.0555404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.0557250Z 2025-05-07T20:01:17.0558832Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.0560694Z 2025-05-07T20:01:17.0562298Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.0564041Z 2025-05-07T20:01:17.0565616Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.0567606Z 2025-05-07T20:01:17.0569072Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.0570711Z 2025-05-07T20:01:17.0572221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.0573992Z 2025-05-07T20:01:17.6080042Z [513/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_dense_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_dense_split_unweighted_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o 2025-05-07T20:01:17.6097038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.6098427Z 2025-05-07T20:01:17.6099600Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.6101014Z 2025-05-07T20:01:17.6102145Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.6103485Z 2025-05-07T20:01:17.6104678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.6106220Z 2025-05-07T20:01:17.6107429Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.6108756Z 2025-05-07T20:01:17.6111506Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.6113073Z 2025-05-07T20:01:17.9244346Z [514/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_split_grad_index_select.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o 2025-05-07T20:01:17.9264045Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.9265619Z 2025-05-07T20:01:17.9266959Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.9268551Z 2025-05-07T20:01:17.9269958Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.9271670Z 2025-05-07T20:01:17.9273038Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.9274555Z 2025-05-07T20:01:17.9275905Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.9277453Z 2025-05-07T20:01:17.9278997Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:17.9280536Z 2025-05-07T20:01:19.7383814Z [515/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o 2025-05-07T20:01:19.7402311Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.7403783Z 2025-05-07T20:01:19.7405243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.7406862Z 2025-05-07T20:01:19.7408223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.7409866Z 2025-05-07T20:01:19.7411064Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.7412506Z 2025-05-07T20:01:19.7414054Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.7415699Z 2025-05-07T20:01:19.7417094Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:19.7418782Z 2025-05-07T20:01:20.6921876Z [516/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o 2025-05-07T20:01:20.6936691Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.6937688Z 2025-05-07T20:01:20.6938556Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.6939701Z 2025-05-07T20:01:20.6940591Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.6941616Z 2025-05-07T20:01:20.6942632Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.6943650Z 2025-05-07T20:01:20.6944545Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.6945540Z 2025-05-07T20:01:20.6946430Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.6947505Z 2025-05-07T20:01:20.9313893Z [517/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_kernel.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o 2025-05-07T20:01:20.9325391Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.9326389Z 2025-05-07T20:01:20.9327508Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.9328487Z 2025-05-07T20:01:20.9329360Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.9330319Z 2025-05-07T20:01:20.9331649Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.9332666Z 2025-05-07T20:01:20.9333522Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.9334503Z 2025-05-07T20:01:20.9335365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:20.9336457Z 2025-05-07T20:01:27.1790782Z [518/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/input_combine_ops/input_combine.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o 2025-05-07T20:01:27.4263432Z [519/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_forward_kernel_small.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o 2025-05-07T20:01:27.4279888Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:27.4281302Z 2025-05-07T20:01:27.4282543Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:27.4283944Z 2025-05-07T20:01:27.4285162Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:27.4286533Z 2025-05-07T20:01:27.4287778Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:27.4289171Z 2025-05-07T20:01:27.4290390Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:27.4291777Z 2025-05-07T20:01:27.4293008Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:27.4294412Z 2025-05-07T20:01:28.7900486Z [520/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o 2025-05-07T20:01:33.4541326Z [521/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/memory_utils/memory_utils_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o 2025-05-07T20:01:35.3075777Z [522/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_kernel_warp.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o 2025-05-07T20:01:35.3087458Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.3088448Z 2025-05-07T20:01:35.3089332Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.3090312Z 2025-05-07T20:01:35.3091168Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.3092148Z 2025-05-07T20:01:35.3093012Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.3093994Z 2025-05-07T20:01:35.3094846Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.3095888Z 2025-05-07T20:01:35.3096762Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:35.3097742Z 2025-05-07T20:01:43.1552528Z [523/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/histogram_binning_calibration_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o 2025-05-07T20:01:45.2900367Z [524/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o 2025-05-07T20:01:46.5855448Z [525/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_codegen_cuda.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o 2025-05-07T20:01:46.5877953Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.5879887Z 2025-05-07T20:01:46.5881576Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.5883660Z 2025-05-07T20:01:46.5885351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.5887241Z 2025-05-07T20:01:46.5889102Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.5891021Z 2025-05-07T20:01:46.5892700Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.5894645Z 2025-05-07T20:01:46.5896333Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:46.5898307Z 2025-05-07T20:01:47.2780646Z [526/608] /github/home/miniconda/envs/build_binary/bin/c++ -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -std=c++17 -fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations -mavx2 -mf16c -mfma -fopenmp -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o.d -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp 2025-05-07T20:01:47.2797636Z clang-16: warning: argument unused during compilation: '-L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib' [-Wunused-command-line-argument] 2025-05-07T20:01:48.3099189Z [527/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o 2025-05-07T20:01:48.7080186Z [528/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_index_select_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_batch_index_select_dim0_backward_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o 2025-05-07T20:01:48.7101910Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:48.7103851Z 2025-05-07T20:01:48.7105873Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:48.7107627Z 2025-05-07T20:01:48.7109086Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:48.7110862Z 2025-05-07T20:01:48.7112406Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:48.7114641Z 2025-05-07T20:01:48.7116124Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:48.7117900Z 2025-05-07T20:01:48.7119465Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:01:48.7121395Z 2025-05-07T20:01:48.7980874Z [529/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o 2025-05-07T20:01:49.3891426Z [530/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_index_select.so -o fbgemm_gpu_tbe_index_select.so CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_cpu_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_ops.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/codegen/training/index_select/batch_index_select_dim0_host.cpp.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_forward_kernel_small.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_batch_index_select_dim0_backward_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_index_select.dir/gen_embedding_backward_split_grad_index_select.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:01:50.0673882Z [531/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/metric_ops/metric_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o 2025-05-07T20:01:51.3932049Z [532/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o 2025-05-07T20:01:52.7809450Z [533/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o 2025-05-07T20:01:53.1297420Z [534/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o 2025-05-07T20:01:53.9402183Z [535/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_tensor_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o 2025-05-07T20:01:53.9540663Z [536/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:01:53.9542509Z ################################################################################ 2025-05-07T20:01:53.9543050Z [CMAKE] Running post-build script ... 2025-05-07T20:01:53.9544054Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:01:53.9544784Z Removing all RPATHs ... 2025-05-07T20:01:53.9545173Z ################################################################################ 2025-05-07T20:01:54.0329020Z [537/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 1 2025-05-07T20:01:54.0330780Z ################################################################################ 2025-05-07T20:01:54.0331820Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.0332570Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:01:54.0333350Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:54.0333882Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:54.0334491Z ################################################################################ 2025-05-07T20:01:54.1701009Z [538/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:54.1702731Z ################################################################################ 2025-05-07T20:01:54.1703199Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.1703945Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:01:54.1704701Z Removing all RPATHs ... 2025-05-07T20:01:54.1705053Z ################################################################################ 2025-05-07T20:01:54.2891671Z [539/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o 2025-05-07T20:01:54.2983924Z [540/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:54.2986250Z ################################################################################ 2025-05-07T20:01:54.2986858Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.2988115Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:01:54.2989110Z Removing all RPATHs ... 2025-05-07T20:01:54.2989616Z ################################################################################ 2025-05-07T20:01:54.4118628Z [541/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:54.4121970Z ################################################################################ 2025-05-07T20:01:54.4122839Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.4124149Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:01:54.4125518Z Removing all RPATHs ... 2025-05-07T20:01:54.4126157Z ################################################################################ 2025-05-07T20:01:54.4224377Z [542/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 1 2025-05-07T20:01:54.4227487Z ################################################################################ 2025-05-07T20:01:54.4228370Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.4229897Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:01:54.4231562Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:54.4232466Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:54.4233664Z ################################################################################ 2025-05-07T20:01:54.4347685Z [543/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 1 2025-05-07T20:01:54.4350842Z ################################################################################ 2025-05-07T20:01:54.4351706Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.4353180Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:01:54.4354590Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:54.4355456Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:54.4356438Z ################################################################################ 2025-05-07T20:01:54.5010443Z [544/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:54.5013365Z ################################################################################ 2025-05-07T20:01:54.5014141Z [CMAKE] Running post-build script ... 2025-05-07T20:01:54.5015473Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:01:54.5017134Z Removing all RPATHs ... 2025-05-07T20:01:54.5017744Z ################################################################################ 2025-05-07T20:01:54.9263807Z [545/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o 2025-05-07T20:01:55.2942672Z [546/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/layout_transform_ops/layout_transform_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o 2025-05-07T20:01:55.2970400Z [547/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 1 2025-05-07T20:01:55.2973473Z ################################################################################ 2025-05-07T20:01:55.2974291Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.2976061Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:01:55.2977654Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.2978509Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.2979481Z ################################################################################ 2025-05-07T20:01:55.4919667Z [548/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 1 2025-05-07T20:01:55.4923101Z ################################################################################ 2025-05-07T20:01:55.4923899Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.4925386Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:01:55.4926645Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.4927552Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.4928572Z ################################################################################ 2025-05-07T20:01:55.9118531Z [549/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 1 2025-05-07T20:01:55.9121120Z ################################################################################ 2025-05-07T20:01:55.9121797Z [CMAKE] Running post-build script ... 2025-05-07T20:01:55.9122882Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:01:55.9123938Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:55.9124670Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:55.9125454Z ################################################################################ 2025-05-07T20:01:56.2750518Z [550/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/dense_to_jagged_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o 2025-05-07T20:01:56.4909924Z [551/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o 2025-05-07T20:01:56.5158575Z [552/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_mx.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o 2025-05-07T20:01:56.7039409Z [553/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o 2025-05-07T20:01:56.7266247Z [554/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_softmax_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o 2025-05-07T20:01:57.1991573Z [555/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_unique_indices.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o 2025-05-07T20:01:57.2002430Z In file included from tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:1: 2025-05-07T20:01:57.2003683Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2009269Z static void __device_stub__ZN10fbgemm_gpu28unique_indices_length_kernelIlLl9223372036854775807ELln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_S5_S5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArg(__par2, 32UL);__cudaSetupArg(__par3, 48UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::unique_indices_length_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:01:57.2014745Z ^ 2025-05-07T20:01:57.2016242Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2017957Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:45:1022: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2019670Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2025485Z static void __device_stub__ZN10fbgemm_gpu24compute_hash_size_kernelIlLln9223372036854775808EEEvN2at27GenericPackedTensorAccessorIT_Lm1ENS1_17RestrictPtrTraitsEiEES5_lS5_(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par0, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par1, const int64_t __par2, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE&__par3){__cudaLaunchPrologue(4);__cudaSetupArg(__par0, 0UL);__cudaSetupArg(__par1, 16UL);__cudaSetupArgSimple(__par2, 32UL);__cudaSetupArg(__par3, 40UL);__cudaLaunch(((char *)((void ( *)(const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE, const int64_t, _ZN2at22PackedTensorAccessor32IlLm1ENS_17RestrictPtrTraitsEEE))fbgemm_gpu::compute_hash_size_kernel )));}namespace fbgemm_gpu{ 2025-05-07T20:01:57.2029930Z ^ 2025-05-07T20:01:57.2031651Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2033517Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:51:860: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2035261Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:54:445: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2036976Z /tmp/tmpxft_000097d7_00000000-6_jagged_unique_indices.compute_80.cudafe1.stub.c:54:1476: warning: integer literal is too large to be represented in a signed integer type, interpreting as unsigned [-Wimplicitly-unsigned-literal] 2025-05-07T20:01:57.2038005Z 8 warnings generated. 2025-05-07T20:01:57.9657366Z [556/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o 2025-05-07T20:01:58.3166095Z [557/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o 2025-05-07T20:01:58.7296779Z [558/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 1 2025-05-07T20:01:58.7299060Z ################################################################################ 2025-05-07T20:01:58.7299660Z [CMAKE] Running post-build script ... 2025-05-07T20:01:58.7300949Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:01:58.7301989Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:58.7302605Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:58.7303297Z ################################################################################ 2025-05-07T20:01:58.8996489Z [559/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 1 2025-05-07T20:01:58.8999009Z ################################################################################ 2025-05-07T20:01:58.8999666Z [CMAKE] Running post-build script ... 2025-05-07T20:01:58.9000755Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:01:58.9001868Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:01:58.9002584Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:01:58.9003375Z ################################################################################ 2025-05-07T20:01:59.4210352Z [560/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o 2025-05-07T20:02:01.3207406Z [561/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o 2025-05-07T20:02:01.3222528Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3223421Z 2025-05-07T20:02:01.3224123Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3224992Z 2025-05-07T20:02:01.3225722Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3226590Z 2025-05-07T20:02:01.3227304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3228185Z 2025-05-07T20:02:01.3228885Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3229753Z 2025-05-07T20:02:01.3230470Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3231579Z 2025-05-07T20:02:01.3232453Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3233493Z 2025-05-07T20:02:01.3234383Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3235260Z 2025-05-07T20:02:01.3235980Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3236840Z 2025-05-07T20:02:01.3237537Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(61): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3238501Z 2025-05-07T20:02:01.3239208Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(121): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3240071Z 2025-05-07T20:02:01.3240792Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fp8_rowwise.cu(161): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:01.3241665Z 2025-05-07T20:02:01.7370818Z [562/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_bfloat16.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o 2025-05-07T20:02:02.3733212Z [563/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o 2025-05-07T20:02:02.3748505Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.3749432Z 2025-05-07T20:02:02.3750166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.3751229Z 2025-05-07T20:02:02.3751977Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.3752967Z 2025-05-07T20:02:02.3753702Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.3754601Z 2025-05-07T20:02:02.3755330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.3756220Z 2025-05-07T20:02:02.3756965Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_nbit_rowwise.cu(78): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.3757845Z 2025-05-07T20:02:02.5434186Z [564/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o 2025-05-07T20:02:02.8802418Z [565/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o 2025-05-07T20:02:02.8817574Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8818503Z 2025-05-07T20:02:02.8819229Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8820126Z 2025-05-07T20:02:02.8821029Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8821936Z 2025-05-07T20:02:02.8822671Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8823570Z 2025-05-07T20:02:02.8824301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8825187Z 2025-05-07T20:02:02.8825928Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8826928Z 2025-05-07T20:02:02.8827647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8828550Z 2025-05-07T20:02:02.8829330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8830230Z 2025-05-07T20:02:02.8831037Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8832304Z 2025-05-07T20:02:02.8833184Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8834070Z 2025-05-07T20:02:02.8834808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8835701Z 2025-05-07T20:02:02.8836423Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(52): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8837451Z 2025-05-07T20:02:02.8838201Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(118): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8839087Z 2025-05-07T20:02:02.8839821Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(154): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8840718Z 2025-05-07T20:02:02.8841450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_fused_8bit_rowwise.cu(195): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:02.8842338Z 2025-05-07T20:02:03.7739863Z [566/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o 2025-05-07T20:02:03.7755146Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7756055Z 2025-05-07T20:02:03.7756949Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7757868Z 2025-05-07T20:02:03.7758602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7759492Z 2025-05-07T20:02:03.7760239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7761138Z 2025-05-07T20:02:03.7761865Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7762891Z 2025-05-07T20:02:03.7763622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7764516Z 2025-05-07T20:02:03.7765243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7766126Z 2025-05-07T20:02:03.7766855Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7767757Z 2025-05-07T20:02:03.7768496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7769398Z 2025-05-07T20:02:03.7770127Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7771022Z 2025-05-07T20:02:03.7771757Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(19): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7772632Z 2025-05-07T20:02:03.7773362Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(94): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7774258Z 2025-05-07T20:02:03.7774985Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(110): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7775877Z 2025-05-07T20:02:03.7776628Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(137): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7777510Z 2025-05-07T20:02:03.7778330Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_padded_fp8_rowwise.cu(166): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:03.7779240Z 2025-05-07T20:02:06.4591955Z [567/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o 2025-05-07T20:02:07.0041589Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:07.0042242Z 2025-05-07T20:02:07.0042727Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:07.0043372Z 2025-05-07T20:02:07.0043850Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_msfp.cu(73): warning #20050-D: inline qualifier ignored for "__global__" function 2025-05-07T20:02:07.0044448Z 2025-05-07T20:02:07.0054498Z [568/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/quantize_ops/quantize_hfp8.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o 2025-05-07T20:02:10.0683330Z [569/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o 2025-05-07T20:02:10.0694851Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.0695826Z 2025-05-07T20:02:10.0696706Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.0697756Z 2025-05-07T20:02:10.0698611Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.0699589Z 2025-05-07T20:02:10.0700460Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.0701429Z 2025-05-07T20:02:10.0702351Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.0703317Z 2025-05-07T20:02:10.0704187Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.0705208Z 2025-05-07T20:02:10.2335679Z [570/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_batched_unary_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o 2025-05-07T20:02:10.2346930Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.2347927Z 2025-05-07T20:02:10.2348868Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.2349851Z 2025-05-07T20:02:10.2350719Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.2351818Z 2025-05-07T20:02:10.2352781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.2353856Z 2025-05-07T20:02:10.2354733Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.2355699Z 2025-05-07T20:02:10.2356583Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.2357552Z 2025-05-07T20:02:10.5161995Z [571/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o 2025-05-07T20:02:10.5173161Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.5174159Z 2025-05-07T20:02:10.5175100Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.5176105Z 2025-05-07T20:02:10.5176647Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:10.5177296Z 2025-05-07T20:02:10.5178166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.5179125Z 2025-05-07T20:02:10.5179987Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.5181039Z 2025-05-07T20:02:10.5181582Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_async_batched_cumsum.cu(16): warning #177-D: variable "kMaxThreads" was declared but never referenced 2025-05-07T20:02:10.5182231Z 2025-05-07T20:02:10.5183138Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.5184104Z 2025-05-07T20:02:10.5184975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:10.5185952Z 2025-05-07T20:02:13.2463948Z [572/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o 2025-05-07T20:02:14.3431640Z [573/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_compute_frequency_sequence.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o 2025-05-07T20:02:14.3443223Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.3444231Z 2025-05-07T20:02:14.3445106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.3446097Z 2025-05-07T20:02:14.3446961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.3447922Z 2025-05-07T20:02:14.3448802Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.3449769Z 2025-05-07T20:02:14.3450621Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.3451599Z 2025-05-07T20:02:14.3452466Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:14.3453456Z 2025-05-07T20:02:14.5489611Z [574/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_embedding_inplace_ops_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -MF CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/embedding_inplace_ops/embedding_inplace_update.cu -o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o 2025-05-07T20:02:15.1136452Z [575/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_embedding_inplace_ops.so -o fbgemm_gpu_embedding_inplace_ops.so CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_cpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update_gpu.cpp.o CMakeFiles/fbgemm_gpu_embedding_inplace_ops.dir/src/embedding_inplace_ops/embedding_inplace_update.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,/lib/intel64:/lib/intel64_win:/lib/win-x64:/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib:/github/home/miniconda/envs/build_binary/lib/stubs: /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed -lcudadevrt -lcudart_static -ldl && : 2025-05-07T20:02:15.1222102Z [576/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:15.1223448Z ################################################################################ 2025-05-07T20:02:15.1223829Z [CMAKE] Running post-build script ... 2025-05-07T20:02:15.1224484Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:15.1225119Z Removing all RPATHs ... 2025-05-07T20:02:15.1225445Z ################################################################################ 2025-05-07T20:02:15.8323879Z [577/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_expand_into_jagged_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o 2025-05-07T20:02:15.8335372Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.8336352Z 2025-05-07T20:02:15.8337239Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.8338320Z 2025-05-07T20:02:15.8339182Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.8340172Z 2025-05-07T20:02:15.8341043Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.8342014Z 2025-05-07T20:02:15.8342961Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.8343929Z 2025-05-07T20:02:15.8344794Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:15.8345783Z 2025-05-07T20:02:17.0449320Z [578/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_group_index.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o 2025-05-07T20:02:17.0460408Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0461409Z 2025-05-07T20:02:17.0462299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0463338Z 2025-05-07T20:02:17.0464189Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0465162Z 2025-05-07T20:02:17.0466023Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0467002Z 2025-05-07T20:02:17.0467918Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0468887Z 2025-05-07T20:02:17.0469758Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:17.0470727Z 2025-05-07T20:02:19.6316489Z [579/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o 2025-05-07T20:02:20.0289824Z [580/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_block_bucketize_features.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o 2025-05-07T20:02:20.0302439Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.0303440Z 2025-05-07T20:02:20.0304307Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.0305292Z 2025-05-07T20:02:20.0306165Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.0307132Z 2025-05-07T20:02:20.0307991Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.0308970Z 2025-05-07T20:02:20.0309820Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.0310785Z 2025-05-07T20:02:20.0311728Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:20.0312816Z 2025-05-07T20:02:21.5292447Z [581/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_add.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o 2025-05-07T20:02:21.5303737Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:21.5304725Z 2025-05-07T20:02:21.5305602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:21.5306593Z 2025-05-07T20:02:21.5307450Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:21.5308425Z 2025-05-07T20:02:21.5309286Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:21.5310258Z 2025-05-07T20:02:21.5311141Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:21.5312099Z 2025-05-07T20:02:21.5313115Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:21.5314124Z 2025-05-07T20:02:23.1766328Z [582/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_index_select.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o 2025-05-07T20:02:23.1783368Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.1785088Z 2025-05-07T20:02:23.1786438Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.1787951Z 2025-05-07T20:02:23.1789232Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.1790641Z 2025-05-07T20:02:23.1792101Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.1793707Z 2025-05-07T20:02:23.1795085Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.1796601Z 2025-05-07T20:02:23.1798154Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:23.1799733Z 2025-05-07T20:02:30.1466303Z [583/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_invert_permute.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o 2025-05-07T20:02:30.1486901Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1488759Z 2025-05-07T20:02:30.1490264Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1492041Z 2025-05-07T20:02:30.1493607Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1495299Z 2025-05-07T20:02:30.1496911Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1498730Z 2025-05-07T20:02:30.1500301Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1502122Z 2025-05-07T20:02:30.1503922Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:30.1505687Z 2025-05-07T20:02:33.3433433Z [584/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_backward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o 2025-05-07T20:02:33.3457237Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:33.3459169Z 2025-05-07T20:02:33.3460895Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:33.3462855Z 2025-05-07T20:02:33.3464554Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:33.3466482Z 2025-05-07T20:02:33.3468212Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:33.3470077Z 2025-05-07T20:02:33.3471781Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:33.3473776Z 2025-05-07T20:02:33.3475622Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:33.3477574Z 2025-05-07T20:02:35.3850376Z [585/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute102.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o 2025-05-07T20:02:35.3861538Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.3862519Z 2025-05-07T20:02:35.3863398Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.3864415Z 2025-05-07T20:02:35.3865290Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.3866263Z 2025-05-07T20:02:35.3867163Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.3868143Z 2025-05-07T20:02:35.3869002Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.3869993Z 2025-05-07T20:02:35.3870921Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:35.3871920Z 2025-05-07T20:02:36.3250356Z [586/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_embeddings.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o 2025-05-07T20:02:36.3261747Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.3262760Z 2025-05-07T20:02:36.3263634Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.3264612Z 2025-05-07T20:02:36.3265489Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.3266459Z 2025-05-07T20:02:36.3267349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.3268329Z 2025-05-07T20:02:36.3269185Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.3270182Z 2025-05-07T20:02:36.3271106Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.3272087Z 2025-05-07T20:02:36.5092761Z [587/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_pack_segments_forward.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o 2025-05-07T20:02:36.5104186Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.5105163Z 2025-05-07T20:02:36.5106052Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.5107031Z 2025-05-07T20:02:36.5107887Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.5108864Z 2025-05-07T20:02:36.5109735Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.5110703Z 2025-05-07T20:02:36.5111602Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.5112646Z 2025-05-07T20:02:36.5113601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.5114585Z 2025-05-07T20:02:36.9188072Z [588/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_1d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o 2025-05-07T20:02:36.9199389Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.9200383Z 2025-05-07T20:02:36.9201284Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.9202271Z 2025-05-07T20:02:36.9203132Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.9204124Z 2025-05-07T20:02:36.9205019Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.9205998Z 2025-05-07T20:02:36.9206883Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.9207859Z 2025-05-07T20:02:36.9208785Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:36.9209793Z 2025-05-07T20:02:37.1009473Z [589/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_range.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o 2025-05-07T20:02:37.1020675Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1021690Z 2025-05-07T20:02:37.1022567Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1023558Z 2025-05-07T20:02:37.1024452Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1025423Z 2025-05-07T20:02:37.1026299Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1027296Z 2025-05-07T20:02:37.1028150Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1029131Z 2025-05-07T20:02:37.1029998Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:37.1031067Z 2025-05-07T20:02:38.6324039Z [590/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_permute_2d.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o 2025-05-07T20:02:38.6335441Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.6336432Z 2025-05-07T20:02:38.6337304Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.6338281Z 2025-05-07T20:02:38.6339152Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.6340111Z 2025-05-07T20:02:38.6340975Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.6341956Z 2025-05-07T20:02:38.6342804Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.6343782Z 2025-05-07T20:02:38.6344640Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.6345616Z 2025-05-07T20:02:38.9856850Z [591/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_zipf.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o 2025-05-07T20:02:38.9867979Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.9868945Z 2025-05-07T20:02:38.9869807Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.9870767Z 2025-05-07T20:02:38.9871601Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.9872644Z 2025-05-07T20:02:38.9873679Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.9874651Z 2025-05-07T20:02:38.9875520Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.9876481Z 2025-05-07T20:02:38.9877339Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:38.9878321Z 2025-05-07T20:02:40.3520089Z [592/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_reorder_batched_ad.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o 2025-05-07T20:02:40.3531367Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3532511Z 2025-05-07T20:02:40.3533392Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3534368Z 2025-05-07T20:02:40.3535221Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3536201Z 2025-05-07T20:02:40.3537066Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3538048Z 2025-05-07T20:02:40.3538900Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3539874Z 2025-05-07T20:02:40.3540739Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3541707Z 2025-05-07T20:02:40.3612526Z [593/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/sparse_ops/sparse_segment_sum_csr.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o 2025-05-07T20:02:40.3623369Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3624326Z 2025-05-07T20:02:40.3625166Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3626133Z 2025-05-07T20:02:40.3626972Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3627903Z 2025-05-07T20:02:40.3628754Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3629694Z 2025-05-07T20:02:40.3630521Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3631934Z 2025-05-07T20:02:40.3632871Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:40.3633854Z 2025-05-07T20:02:44.6920650Z [594/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o 2025-05-07T20:02:44.6932808Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.6933809Z 2025-05-07T20:02:44.6934678Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.6935655Z 2025-05-07T20:02:44.6936526Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.6937487Z 2025-05-07T20:02:44.6938349Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.6939344Z 2025-05-07T20:02:44.6940196Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.6941176Z 2025-05-07T20:02:44.6942083Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:44.6943060Z 2025-05-07T20:02:45.7607365Z [595/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o 2025-05-07T20:02:45.7618931Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.7619838Z 2025-05-07T20:02:45.7620656Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.7621578Z 2025-05-07T20:02:45.7622365Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.7623270Z 2025-05-07T20:02:45.7624073Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.7624971Z 2025-05-07T20:02:45.7625826Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.7626719Z 2025-05-07T20:02:45.7627512Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:45.7628420Z 2025-05-07T20:02:50.5598188Z [596/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_tbe_training_backward_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o -MF CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu -o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o 2025-05-07T20:02:50.5609496Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.5610408Z 2025-05-07T20:02:50.5611243Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.5612146Z 2025-05-07T20:02:50.5612933Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.5613839Z 2025-05-07T20:02:50.5614703Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.5615602Z 2025-05-07T20:02:50.5616404Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __host__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.5617291Z 2025-05-07T20:02:50.5618095Z /__w/FBGEMM/FBGEMM/fbgemm_gpu/include/fbgemm_gpu/utils/stochastic_rounding.cuh(32): warning #20012-D: __device__ annotation is ignored on a function("StochasticRoundingRNGState") that is explicitly defaulted on its first declaration 2025-05-07T20:02:50.5619029Z 2025-05-07T20:02:50.5786762Z [597/608] /github/home/miniconda/envs/build_binary/bin/nvcc -forward-unknown-to-host-compiler -DUSE_C10D_GLOO -DUSE_C10D_NCCL -DUSE_DISTRIBUTED -DUSE_RPC -DUSE_TENSORPIPE -Dfbgemm_gpu_py_EXPORTS -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/asmjit/src -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cpuinfo/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/cutlass/tools/util/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/composable_kernel/include -I/__w/FBGEMM/FBGEMM/fbgemm_gpu/../external/json/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include -isystem /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -isystem /github/home/miniconda/envs/build_binary/include -DONNX_NAMESPACE=onnx_c2 -gencode arch=compute_70,code=sm_70 -gencode arch=compute_80,code=sm_80 -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --expt-relaxed-constexpr --expt-extended-lambda -O3 -DNDEBUG -std=c++17 -Xcompiler=-fPIC -Wno-deprecated-anon-enum-enum-conversion -Wno-deprecated-declarations --expt-relaxed-constexpr -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -MD -MT CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o -MF CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o.d -x cu -c /__w/FBGEMM/FBGEMM/fbgemm_gpu/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu -o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o 2025-05-07T20:02:51.4155484Z [598/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_py.so -o fbgemm_gpu_py.so CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_function.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_autograd.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_cpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_meta.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_models.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/eeg_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_estimator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator_ops.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/tbe/eeg/indices_generator.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops_host.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/merge_pooled_embedding_ops/merge_pooled_embedding_ops_gpu.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/topology_utils.cpp.o CMakeFiles/fbgemm_gpu_py.dir/src/histogram_binning_calibration_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/input_combine_ops/input_combine.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/intraining_embedding_pruning_ops/intraining_embedding_pruning.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/memory_utils/memory_utils_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/batched_dense_vec_jagged_2d_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/dense_to_jagged_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_dense_elementwise_add_jagged_output_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_dense_elementwise_mul_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_add_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_index_select_2d_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_jagged_bmm_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_softmax_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_tensor_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_to_padded_dense_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/jagged_unique_indices.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/jagged_tensor_ops/keyed_jagged_index_select_dim1.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/layout_transform_ops/layout_transform_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/metric_ops/metric_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops_split.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_pooled_embedding_ops/permute_pooled_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/permute_multi_embedding_ops/permute_multi_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_bfloat16.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_8bit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_fused_nbit_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_hfp8.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_msfp.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_padded_fp8_rowwise.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/quantize_ops/quantize_mx.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_async_batched_cumsum.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_block_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_bucketize_features.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_batched_unary_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_compute_frequency_sequence.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_expand_into_jagged_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_group_index.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_add.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_index_select.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_invert_permute.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_backward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_pack_segments_forward.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_1d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_2d.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute102.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_permute_embeddings.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_range.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_reorder_batched_ad.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_segment_sum_csr.cu.o CMakeFiles/fbgemm_gpu_py.dir/src/sparse_ops/sparse_zipf.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm.so fbgemm_gpu_embedding_inplace_ops.so fbgemm_gpu_tbe_index_select.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_optimizers.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm_gpu_sparse_async_cumsum.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:02:51.5516631Z [599/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 1 2025-05-07T20:02:51.5517900Z ################################################################################ 2025-05-07T20:02:51.5518481Z [CMAKE] Running post-build script ... 2025-05-07T20:02:51.5519031Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:02:51.5519567Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:51.5519957Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:51.5520383Z ################################################################################ 2025-05-07T20:02:52.0116052Z [600/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward.so -o fbgemm_gpu_tbe_training_backward.so CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/codegen/training/backward/embedding_backward_dense_host_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_lars_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_adam_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_partial_rowwise_lamb_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_none_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_sgd_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_counter_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_approx_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_adagrad_with_weight_decay_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_rowwise_weighted_adagrad_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_cpu.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_grad_embedding_ops.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_split_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_dense_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_ssd_indice_weights_codegen_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_lars_sgd_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_adam_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_partial_rowwise_lamb_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_none_split_unweighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_cache.so fbgemm_gpu_tbe_common.so fbgemm_gpu_sparse_async_cumsum.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed fbgemm.so fbgemm_gpu_config.so fbgemm_gpu_tbe_utils.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:02:52.6461839Z [601/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_gwd.so -o fbgemm_gpu_tbe_training_backward_gwd.so CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_gwd_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_gwd.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_gwd_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -lrt -lpthread -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build -L"/github/home/miniconda/envs/build_binary/lib/stubs" -L"/github/home/miniconda/envs/build_binary/lib" && : 2025-05-07T20:02:52.6863548Z [602/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 1 2025-05-07T20:02:52.6864921Z ################################################################################ 2025-05-07T20:02:52.6865289Z [CMAKE] Running post-build script ... 2025-05-07T20:02:52.6866100Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:52.6866730Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:52.6867110Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:52.6867517Z ################################################################################ 2025-05-07T20:02:52.8359317Z [603/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_vbe.so -o fbgemm_gpu_tbe_training_backward_vbe.so CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_ssd_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_rowwise_adagrad_with_counter_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_sgd_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_dense_split_unweighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_weighted_vbe_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_vbe.dir/gen_embedding_backward_adam_split_unweighted_vbe_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:02:53.9572564Z [604/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 1 2025-05-07T20:02:53.9576447Z ################################################################################ 2025-05-07T20:02:53.9577503Z [CMAKE] Running post-build script ... 2025-05-07T20:02:53.9578561Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:53.9579176Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:53.9579553Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:53.9579957Z ################################################################################ 2025-05-07T20:02:56.2773062Z [605/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 1 2025-05-07T20:02:56.2774443Z ################################################################################ 2025-05-07T20:02:56.2774860Z [CMAKE] Running post-build script ... 2025-05-07T20:02:56.2775497Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:56.2776174Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:56.2776579Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:56.2777004Z ################################################################################ 2025-05-07T20:02:57.5994142Z [606/608] : && /github/home/miniconda/envs/build_binary/bin/c++ -fPIC -DTORCH_USE_CUDA_DSA -DTORCH_USE_HIP_DSA -L/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib -fopenmp=libgomp -stdlib=libstdc++ -I/github/home/miniconda/envs/build_binary/include -O3 -DNDEBUG -Wl,-O2 -Wl,--sort-common -Wl,--as-needed -Wl,-z,relro -Wl,-z,now -Wl,--disable-new-dtags -Wl,--gc-sections -Wl,--allow-shlib-undefined -Wl,-rpath,/github/home/miniconda/envs/build_binary/lib -Wl,-rpath-link,/github/home/miniconda/envs/build_binary/lib -L/github/home/miniconda/envs/build_binary/lib -s -shared -Wl,-soname,fbgemm_gpu_tbe_training_backward_dense.so -o fbgemm_gpu_tbe_training_backward_dense.so CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_meta.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_split_dense.cpp.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_weighted_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_nobag_kernel_warp.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_cuda.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_cta.cu.o CMakeFiles/fbgemm_gpu_tbe_training_backward_dense.dir/gen_embedding_backward_dense_split_unweighted_kernel_warp.cu.o -L/lib/intel64 -L/lib/intel64_win -L/lib/win-x64 -L/github/home/miniconda/envs/build_binary/lib/stubs -Wl,-rpath,"\$ORIGIN" /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libnvrtc.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/nvidia/nccl/lib/libnccl.so.2 /github/home/miniconda/envs/build_binary/lib/stubs/libcuda.so fbgemm_gpu_tbe_training_backward.so /github/home/miniconda/envs/build_binary/lib/stubs/libnvidia-ml.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch.so -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so" -Wl,--as-needed -Wl,--no-as-needed,"/github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libtorch_cuda.so" -Wl,--as-needed /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10_cuda.so /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/torch/lib/libc10.so /github/home/miniconda/envs/build_binary/lib/libcudart.so -lcudadevrt -lcudart_static -ldl -Wl,-rpath-link,/__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && : 2025-05-07T20:02:57.8314965Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/../.github/scripts/fbgemm_gpu_postbuild.bash /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 1 2025-05-07T20:02:57.8316351Z ################################################################################ 2025-05-07T20:02:57.8316719Z [CMAKE] Running post-build script ... 2025-05-07T20:02:57.8317395Z Target file: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:57.8318047Z Resetting RPATH to $ORIGIN ... 2025-05-07T20:02:57.8318446Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:02:57.8318886Z ################################################################################ 2025-05-07T20:02:57.8320126Z [607/608] cd /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-build && /github/home/miniconda/envs/build_binary/lib/python3.10/site-packages/cmake/data/bin/cmake -P cmake_install.cmake 2025-05-07T20:02:57.8358335Z -- Install configuration: "Release" 2025-05-07T20:02:57.8363002Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/asmjit.so 2025-05-07T20:02:57.8618517Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm.so 2025-05-07T20:02:57.8619486Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so 2025-05-07T20:02:57.8637940Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so 2025-05-07T20:02:57.8638951Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so 2025-05-07T20:02:57.8661338Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so 2025-05-07T20:02:57.8677522Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:02:57.8678565Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so 2025-05-07T20:02:57.8679720Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:02:57.8707383Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:02:57.8708899Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:02:57.8709982Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:02:57.8711040Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:02:57.8712219Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:02:57.8713468Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:02:57.8714609Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:02:57.8715699Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:02:57.8717278Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py 2025-05-07T20:02:57.8718665Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py 2025-05-07T20:02:57.8719956Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py 2025-05-07T20:02:57.8721158Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py 2025-05-07T20:02:57.8722391Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py 2025-05-07T20:02:57.8723623Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py 2025-05-07T20:02:57.8724913Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py 2025-05-07T20:02:57.8726274Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py 2025-05-07T20:02:57.8727768Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py 2025-05-07T20:02:57.8729077Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py 2025-05-07T20:02:57.8730467Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py 2025-05-07T20:02:57.8731960Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py 2025-05-07T20:02:57.8733237Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py 2025-05-07T20:02:57.8734483Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py 2025-05-07T20:02:57.8735892Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py 2025-05-07T20:02:57.8737212Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py 2025-05-07T20:02:57.8738389Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:02:57.8749400Z -- Installing: /__w/FBGEMM/FBGEMM/fbgemm_gpu/_skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so 2025-05-07T20:02:57.8794051Z 2025-05-07T20:02:57.8838257Z 2025-05-07T20:02:57.8838946Z copying fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py 2025-05-07T20:02:57.8840826Z copying fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py 2025-05-07T20:02:57.8842508Z copying fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py 2025-05-07T20:02:57.8843904Z copying fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py 2025-05-07T20:02:57.8845720Z copying fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py 2025-05-07T20:02:57.8848014Z copying fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py 2025-05-07T20:02:57.8850029Z copying fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py 2025-05-07T20:02:57.8851502Z copying fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py 2025-05-07T20:02:57.8852975Z copying fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py 2025-05-07T20:02:57.8854415Z copying fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py 2025-05-07T20:02:57.8856081Z copying fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py 2025-05-07T20:02:57.8858198Z copying fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py 2025-05-07T20:02:57.8860437Z copying fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py 2025-05-07T20:02:57.8862374Z copying fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py 2025-05-07T20:02:57.8864511Z copying fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py 2025-05-07T20:02:57.8866689Z copying fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py 2025-05-07T20:02:57.8869131Z copying fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py 2025-05-07T20:02:57.8871889Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py 2025-05-07T20:02:57.8874616Z copying fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py 2025-05-07T20:02:57.8877066Z copying fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py 2025-05-07T20:02:57.8879154Z copying fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py 2025-05-07T20:02:57.8880778Z copying fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py 2025-05-07T20:02:57.8881940Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config 2025-05-07T20:02:57.8883285Z copying fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py 2025-05-07T20:02:57.8884890Z copying fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py 2025-05-07T20:02:57.8886311Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs 2025-05-07T20:02:57.8887574Z copying fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py 2025-05-07T20:02:57.8889081Z copying fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py 2025-05-07T20:02:57.8890506Z copying fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py 2025-05-07T20:02:57.8892123Z copying fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py 2025-05-07T20:02:57.8893983Z copying fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py 2025-05-07T20:02:57.8896035Z copying fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py 2025-05-07T20:02:57.8897944Z copying fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py 2025-05-07T20:02:57.8899554Z copying fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py 2025-05-07T20:02:57.8901139Z copying fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py 2025-05-07T20:02:57.8902393Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize 2025-05-07T20:02:57.8903777Z copying fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py 2025-05-07T20:02:57.8905482Z copying fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py 2025-05-07T20:02:57.8906890Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll 2025-05-07T20:02:57.8908116Z copying fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py 2025-05-07T20:02:57.8909345Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe 2025-05-07T20:02:57.8910556Z copying fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py 2025-05-07T20:02:57.8911916Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton 2025-05-07T20:02:57.8913294Z copying fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py 2025-05-07T20:02:57.8914752Z copying fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py 2025-05-07T20:02:57.8916333Z copying fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py 2025-05-07T20:02:57.8917975Z copying fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py 2025-05-07T20:02:57.8919509Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils 2025-05-07T20:02:57.8920777Z copying fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py 2025-05-07T20:02:57.8922254Z copying fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py 2025-05-07T20:02:57.8923799Z copying fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py 2025-05-07T20:02:57.8925332Z copying fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py 2025-05-07T20:02:57.8926811Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu 2025-05-07T20:02:57.8928155Z copying fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py 2025-05-07T20:02:57.8929725Z copying fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py 2025-05-07T20:02:57.8931056Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta 2025-05-07T20:02:57.8932660Z copying fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py 2025-05-07T20:02:57.8934337Z copying fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py 2025-05-07T20:02:57.8935781Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.8937145Z copying fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py 2025-05-07T20:02:57.8938819Z copying fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py 2025-05-07T20:02:57.8940892Z copying fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py 2025-05-07T20:02:57.8943302Z copying fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py 2025-05-07T20:02:57.8945532Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py 2025-05-07T20:02:57.8947620Z copying fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py 2025-05-07T20:02:57.8950030Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py 2025-05-07T20:02:57.8952874Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py 2025-05-07T20:02:57.8955553Z copying fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py 2025-05-07T20:02:57.8958075Z copying fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py 2025-05-07T20:02:57.8960726Z copying fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py 2025-05-07T20:02:57.8963168Z copying fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py 2025-05-07T20:02:57.8965610Z copying fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py 2025-05-07T20:02:57.8967567Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.8969034Z copying fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py 2025-05-07T20:02:57.8970668Z copying fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py 2025-05-07T20:02:57.8972560Z copying fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py 2025-05-07T20:02:57.8974149Z copying fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py 2025-05-07T20:02:57.8976198Z copying fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py 2025-05-07T20:02:57.8978335Z copying fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py 2025-05-07T20:02:57.8980214Z copying fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py 2025-05-07T20:02:57.8981669Z copying fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py 2025-05-07T20:02:57.8983564Z copying fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py 2025-05-07T20:02:57.8985518Z copying fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py 2025-05-07T20:02:57.8987245Z copying fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py 2025-05-07T20:02:57.8988541Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache 2025-05-07T20:02:57.8990017Z copying fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py 2025-05-07T20:02:57.8991958Z copying fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py 2025-05-07T20:02:57.8993746Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:57.8995117Z copying fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py 2025-05-07T20:02:57.8996670Z copying fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py 2025-05-07T20:02:57.8998302Z copying fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py 2025-05-07T20:02:57.8999957Z copying fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py 2025-05-07T20:02:57.9001345Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats 2025-05-07T20:02:57.9002752Z copying fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py 2025-05-07T20:02:57.9004631Z copying fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py 2025-05-07T20:02:57.9006281Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9008152Z copying fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py 2025-05-07T20:02:57.9009774Z copying fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py 2025-05-07T20:02:57.9011432Z copying fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py 2025-05-07T20:02:57.9013138Z copying fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py 2025-05-07T20:02:57.9014865Z copying fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py 2025-05-07T20:02:57.9016467Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:57.9017899Z copying fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py 2025-05-07T20:02:57.9020096Z copying fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py 2025-05-07T20:02:57.9022078Z creating directory _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged 2025-05-07T20:02:57.9023575Z copying fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py 2025-05-07T20:02:57.9025481Z copying fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py 2025-05-07T20:02:57.9026634Z 2025-05-07T20:02:57.9088265Z INFO:root:running bdist_wheel 2025-05-07T20:02:57.9119831Z INFO:root:running build 2025-05-07T20:02:57.9120323Z INFO:root:running build_py 2025-05-07T20:02:57.9127608Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9130154Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9133281Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9135713Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9138000Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9140502Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9143268Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9145846Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9148215Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9150639Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9153114Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9155697Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9158336Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9161040Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9163643Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9166404Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9169080Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9172099Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9174844Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9177823Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9180782Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9183627Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9186027Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9187895Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:02:57.9189899Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:02:57.9192721Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:02:57.9194691Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9196589Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9199026Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9201513Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9204048Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9206825Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9209623Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9212237Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9214716Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9217258Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:02:57.9219250Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:02:57.9221270Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:02:57.9223949Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:02:57.9226004Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:02:57.9227891Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:02:57.9229753Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:02:57.9232094Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:02:57.9235166Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:02:57.9237135Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:02:57.9239628Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:02:57.9242190Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:02:57.9244806Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:02:57.9247293Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:02:57.9249280Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:02:57.9251779Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:02:57.9254294Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:02:57.9256841Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:02:57.9259272Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:02:57.9261248Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:02:57.9263723Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:02:57.9267502Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:02:57.9269599Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:02:57.9272312Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:02:57.9276766Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9278953Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9281601Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9284449Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9287529Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9290533Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9293408Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9296446Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9299596Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9302821Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9305928Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9309189Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9312172Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9315445Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:02:57.9317767Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9319802Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9322426Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9325223Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9327871Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9330708Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9333760Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9336503Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9339180Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9342159Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9345059Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9347783Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:02:57.9349823Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:02:57.9351872Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:02:57.9354726Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:02:57.9356853Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:57.9358836Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:57.9361294Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:57.9363868Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:57.9366663Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:02:57.9368719Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:02:57.9370751Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:02:57.9373553Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:02:57.9375921Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9377978Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9380575Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9383283Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9385942Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9388610Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:02:57.9390825Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:57.9393048Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:57.9395963Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:02:57.9398404Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:02:57.9400644Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:02:57.9403679Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:02:57.9451368Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9491174Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:57.9800577Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:58.0410564Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:59.6741166Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:59.6745950Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:59.7369330Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:59.7432949Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:59.7552614Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:02:59.7905723Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:01.2857274Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:01.3682508Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:05.3064308Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:05.9101702Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:07.3073841Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:07.5476812Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:07.5853472Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:07.7270711Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7272550Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7276715Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7288305Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7295996Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7310840Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7317800Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7323926Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7339036Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7346048Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7357745Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7365073Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7372304Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7378264Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7383938Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:07.7396574Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:07.7398525Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:07.7404013Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:07.7411393Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:07.7451013Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0441477Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0444257Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0446773Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0449119Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0451692Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0454574Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0457233Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0459666Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0462242Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0464576Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0467026Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0469700Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0472642Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0475255Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0477895Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0480601Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0483390Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0486220Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0489108Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0491993Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0494674Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0497141Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu 2025-05-07T20:03:08.0499521Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:03:08.0502082Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config 2025-05-07T20:03:08.0504601Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0507152Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0509667Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0512301Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0514991Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0517763Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0520461Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0523098Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0525605Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs 2025-05-07T20:03:08.0528137Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:03:08.0532491Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize 2025-05-07T20:03:08.0535034Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll 2025-05-07T20:03:08.0537518Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe 2025-05-07T20:03:08.0540037Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:03:08.0542692Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:03:08.0545350Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:03:08.0548047Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton 2025-05-07T20:03:08.0550806Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:03:08.0553526Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:03:08.0556124Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:03:08.0558747Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils 2025-05-07T20:03:08.0561527Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:03:08.0564174Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu 2025-05-07T20:03:08.0566927Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:03:08.0569581Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta 2025-05-07T20:03:08.0572287Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0575041Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0578160Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0581365Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0584397Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0587404Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0590570Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0593962Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0597283Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0600524Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0603870Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0607031Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0610251Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0613253Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0616101Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0618940Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0621754Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0624648Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0627673Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0630695Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0633724Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0636606Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0639574Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0642488Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0645208Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:03:08.0648051Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache 2025-05-07T20:03:08.0650819Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0653479Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0656123Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0673897Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0676493Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:03:08.0679114Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats 2025-05-07T20:03:08.0682054Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0684768Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0687602Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0690657Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0693302Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0695954Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:08.0699252Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:08.0702156Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:03:08.0704952Z INFO:root:copying _skbuild/linux-x86_64-3.10/cmake-install/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged 2025-05-07T20:03:08.0710411Z INFO:skbuild:copied 90 files 2025-05-07T20:03:08.0710872Z INFO:root:running build_ext 2025-05-07T20:03:08.0711629Z INFO:root:installing to _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:08.0712463Z INFO:root:running install 2025-05-07T20:03:08.0763763Z INFO:root:running install_lib 2025-05-07T20:03:08.0765529Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:08.0768593Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu 2025-05-07T20:03:08.0770119Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/config 2025-05-07T20:03:08.0772316Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:03:08.0775193Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/config/feature_list.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/config 2025-05-07T20:03:08.0777350Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/docs 2025-05-07T20:03:08.0779485Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0782478Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0785386Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/examples.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0788379Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0791569Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/merge_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0794819Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/permute_pooled_embedding_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0797994Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0800904Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0803899Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/docs/version.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/docs 2025-05-07T20:03:08.0806056Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/quantize 2025-05-07T20:03:08.0808479Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:03:08.0811533Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize/quantize_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/quantize 2025-05-07T20:03:08.0813795Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll 2025-05-07T20:03:08.0815254Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/cpu 2025-05-07T20:03:08.0817443Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:03:08.0820511Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/cpu/cpu_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/cpu 2025-05-07T20:03:08.0822706Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/meta 2025-05-07T20:03:08.0824926Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:03:08.0827892Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/meta/meta_sll.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/meta 2025-05-07T20:03:08.0830265Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0832862Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0835848Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0839041Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0842493Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0845965Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_bmm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0849403Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0852845Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0856420Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0860064Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0863668Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0867242Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0870722Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_jagged_softmax.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0874293Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll/triton 2025-05-07T20:03:08.0877527Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sll/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/sll 2025-05-07T20:03:08.0879577Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe 2025-05-07T20:03:08.0880945Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0883200Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0886338Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/bench_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0889480Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/bench_runs.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0892514Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/eeg_cli.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0895727Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/embedding_ops_common_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0899084Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/eval_compression.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0902241Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0905512Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0908731Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config_loader.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0912002Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0915389Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/bench/utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/bench 2025-05-07T20:03:08.0917685Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/cache 2025-05-07T20:03:08.0919842Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:03:08.0922794Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/cache 2025-05-07T20:03:08.0925052Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0926489Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:08.0928686Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:08.0932035Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd/utils 2025-05-07T20:03:08.0935126Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0937905Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0940853Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/inference.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0943769Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/ssd/training.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/ssd 2025-05-07T20:03:08.0945810Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/stats 2025-05-07T20:03:08.0947971Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:03:08.0950837Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/stats/bench_params_reporter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/stats 2025-05-07T20:03:08.0953264Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0955502Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0958321Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0961174Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/offsets.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0964134Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0966946Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/utils/requests.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe/utils 2025-05-07T20:03:08.0969426Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/tbe 2025-05-07T20:03:08.0971233Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton 2025-05-07T20:03:08.0972919Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/triton/jagged 2025-05-07T20:03:08.0975145Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:03:08.0978271Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton/jagged 2025-05-07T20:03:08.0981209Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:08.0983947Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:08.0986847Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/quantize.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:08.0989527Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/triton/quantize_ref.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/triton 2025-05-07T20:03:08.0991462Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/utils 2025-05-07T20:03:08.0993672Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:08.0996410Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/filestore.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:08.0999311Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/loader.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:08.1002071Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/utils/torch_library.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/utils 2025-05-07T20:03:08.1004806Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/asmjit.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.1007241Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.1009666Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_cache.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.1054900Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_inference.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.2396604Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_config.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.2399437Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_utils.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.2453246Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.2457999Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_common.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.2475150Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.2506191Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.3666180Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.3733253Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.6764597Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.7241213Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8316941Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8505152Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8535166Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_tbe_index_select.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8649554Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8652456Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8656574Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8660843Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8664901Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8668954Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8672930Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8677123Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8681287Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8685385Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8689720Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8694023Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8698293Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8702216Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8706270Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_codegen_lookup_invokers 2025-05-07T20:03:08.8709200Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:08.8712182Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:08.8716534Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu/split_embedding_optimizer_codegen 2025-05-07T20:03:08.8719973Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8722807Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/fbgemm_gpu_py.so -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8938914Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/__init__.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8941865Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/batched_unary_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8944735Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/enums.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8947372Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/metrics.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8950140Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/permute_pooled_embedding_modules.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8953394Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/permute_pooled_embedding_modules_split.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8956565Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize_comm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8959390Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/quantize_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8962146Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/runtime_monitor.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8964900Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/sparse_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8967763Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_configs.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8970821Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_inference_converter.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8973460Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_optimizer_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8976263Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_embedding_utils.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8979305Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8982647Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8985968Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_inference.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8989168Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_training.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8992461Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8995887Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.8998955Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/tbe_input_multiplexer.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.9001687Z INFO:root:copying _skbuild/linux-x86_64-3.10/setuptools/lib.linux-x86_64-cpython-310/fbgemm_gpu/uvm.py -> _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu 2025-05-07T20:03:08.9003314Z INFO:skbuild:copied 125 files 2025-05-07T20:03:08.9003782Z INFO:root:running install_egg_info 2025-05-07T20:03:08.9019745Z INFO:root:running egg_info 2025-05-07T20:03:08.9046866Z INFO:root:writing fbgemm_gpu_nightly.egg-info/PKG-INFO 2025-05-07T20:03:08.9049417Z INFO:root:writing dependency_links to fbgemm_gpu_nightly.egg-info/dependency_links.txt 2025-05-07T20:03:08.9052822Z INFO:root:writing requirements to fbgemm_gpu_nightly.egg-info/requires.txt 2025-05-07T20:03:08.9054171Z INFO:root:writing top-level names to fbgemm_gpu_nightly.egg-info/top_level.txt 2025-05-07T20:03:08.9152089Z INFO:root:reading manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:03:08.9192581Z INFO:root:writing manifest file 'fbgemm_gpu_nightly.egg-info/SOURCES.txt' 2025-05-07T20:03:08.9194214Z INFO:root:Copying fbgemm_gpu_nightly.egg-info to _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/./fbgemm_gpu_nightly-2025.5.7-py3.10.egg-info 2025-05-07T20:03:08.9200294Z INFO:root:running install_scripts 2025-05-07T20:03:08.9200802Z INFO:skbuild:copied 0 files 2025-05-07T20:03:11.6316875Z INFO:root:creating _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel/fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL 2025-05-07T20:03:11.6320249Z INFO:wheel:creating '/__w/FBGEMM/FBGEMM/fbgemm_gpu/dist/.tmp-2e_4cqdf/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl' and adding '_skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel' to it 2025-05-07T20:03:11.6323436Z INFO:wheel:adding 'fbgemm_gpu/__init__.py' 2025-05-07T20:03:11.6585432Z INFO:wheel:adding 'fbgemm_gpu/asmjit.so' 2025-05-07T20:03:11.6595315Z INFO:wheel:adding 'fbgemm_gpu/batched_unary_embeddings_ops.py' 2025-05-07T20:03:11.6596918Z INFO:wheel:adding 'fbgemm_gpu/enums.py' 2025-05-07T20:03:11.8606171Z INFO:wheel:adding 'fbgemm_gpu/fbgemm.so' 2025-05-07T20:03:11.8737180Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_config.so' 2025-05-07T20:03:11.8843286Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_embedding_inplace_ops.so' 2025-05-07T20:03:12.8449999Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_py.so' 2025-05-07T20:03:12.9546637Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_sparse_async_cumsum.so' 2025-05-07T20:03:13.3161974Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_cache.so' 2025-05-07T20:03:13.3759897Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_common.so' 2025-05-07T20:03:13.6882267Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_index_select.so' 2025-05-07T20:03:22.1376489Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_inference.so' 2025-05-07T20:03:22.7399872Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_optimizers.so' 2025-05-07T20:03:37.0446601Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward.so' 2025-05-07T20:03:38.5967634Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_dense.so' 2025-05-07T20:03:40.5536629Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_gwd.so' 2025-05-07T20:03:41.1110558Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_pt2.so' 2025-05-07T20:03:41.3342504Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_split_host.so' 2025-05-07T20:03:45.8993690Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_backward_vbe.so' 2025-05-07T20:03:51.7980833Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_training_forward.so' 2025-05-07T20:03:52.5523836Z INFO:wheel:adding 'fbgemm_gpu/fbgemm_gpu_tbe_utils.so' 2025-05-07T20:03:52.5699668Z INFO:wheel:adding 'fbgemm_gpu/metrics.py' 2025-05-07T20:03:52.5700132Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules.py' 2025-05-07T20:03:52.5701221Z INFO:wheel:adding 'fbgemm_gpu/permute_pooled_embedding_modules_split.py' 2025-05-07T20:03:52.5704842Z INFO:wheel:adding 'fbgemm_gpu/quantize_comm.py' 2025-05-07T20:03:52.5707915Z INFO:wheel:adding 'fbgemm_gpu/quantize_utils.py' 2025-05-07T20:03:52.5711083Z INFO:wheel:adding 'fbgemm_gpu/runtime_monitor.py' 2025-05-07T20:03:52.5722147Z INFO:wheel:adding 'fbgemm_gpu/sparse_ops.py' 2025-05-07T20:03:52.5725929Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_configs.py' 2025-05-07T20:03:52.5728873Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_inference_converter.py' 2025-05-07T20:03:52.5730631Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_ops.py' 2025-05-07T20:03:52.5732611Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_utils.py' 2025-05-07T20:03:52.5734893Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops.py' 2025-05-07T20:03:52.5738175Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_common.py' 2025-05-07T20:03:52.5759471Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_inference.py' 2025-05-07T20:03:52.5800534Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training.py' 2025-05-07T20:03:52.5806008Z INFO:wheel:adding 'fbgemm_gpu/split_table_batched_embeddings_ops_training_common.py' 2025-05-07T20:03:52.5807735Z INFO:wheel:adding 'fbgemm_gpu/ssd_split_table_batched_embeddings_ops.py' 2025-05-07T20:03:52.5809797Z INFO:wheel:adding 'fbgemm_gpu/tbe_input_multiplexer.py' 2025-05-07T20:03:52.5811474Z INFO:wheel:adding 'fbgemm_gpu/uvm.py' 2025-05-07T20:03:52.5813349Z INFO:wheel:adding 'fbgemm_gpu/config/__init__.py' 2025-05-07T20:03:52.5815308Z INFO:wheel:adding 'fbgemm_gpu/config/feature_list.py' 2025-05-07T20:03:52.5817274Z INFO:wheel:adding 'fbgemm_gpu/docs/__init__.py' 2025-05-07T20:03:52.5818721Z INFO:wheel:adding 'fbgemm_gpu/docs/common.py' 2025-05-07T20:03:52.5820689Z INFO:wheel:adding 'fbgemm_gpu/docs/examples.py' 2025-05-07T20:03:52.5823303Z INFO:wheel:adding 'fbgemm_gpu/docs/jagged_tensor_ops.py' 2025-05-07T20:03:52.5825268Z INFO:wheel:adding 'fbgemm_gpu/docs/merge_pooled_embedding_ops.py' 2025-05-07T20:03:52.5827513Z INFO:wheel:adding 'fbgemm_gpu/docs/permute_pooled_embedding_ops.py' 2025-05-07T20:03:52.5829307Z INFO:wheel:adding 'fbgemm_gpu/docs/quantize_ops.py' 2025-05-07T20:03:52.5835843Z INFO:wheel:adding 'fbgemm_gpu/docs/sparse_ops.py' 2025-05-07T20:03:52.5837807Z INFO:wheel:adding 'fbgemm_gpu/docs/version.py' 2025-05-07T20:03:52.5839737Z INFO:wheel:adding 'fbgemm_gpu/quantize/__init__.py' 2025-05-07T20:03:52.5841706Z INFO:wheel:adding 'fbgemm_gpu/quantize/quantize_ops.py' 2025-05-07T20:03:52.5843977Z INFO:wheel:adding 'fbgemm_gpu/sll/__init__.py' 2025-05-07T20:03:52.5846109Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/__init__.py' 2025-05-07T20:03:52.5852531Z INFO:wheel:adding 'fbgemm_gpu/sll/cpu/cpu_sll.py' 2025-05-07T20:03:52.5855217Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/__init__.py' 2025-05-07T20:03:52.5857820Z INFO:wheel:adding 'fbgemm_gpu/sll/meta/meta_sll.py' 2025-05-07T20:03:52.5860445Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/__init__.py' 2025-05-07T20:03:52.5862184Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/common.py' 2025-05-07T20:03:52.5864129Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_dense_jagged_cat_jagged_out.py' 2025-05-07T20:03:52.5866659Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged2_to_padded_dense.py' 2025-05-07T20:03:52.5870363Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm.py' 2025-05-07T20:03:52.5874705Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_bmm_jagged_out.py' 2025-05-07T20:03:52.5876937Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_add.py' 2025-05-07T20:03:52.5879323Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_elementwise_mul_jagged_out.py' 2025-05-07T20:03:52.5884939Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_dense_flash_attention.py' 2025-05-07T20:03:52.5890411Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_flash_attention_basic.py' 2025-05-07T20:03:52.5892727Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_self_substraction_jagged_out.py' 2025-05-07T20:03:52.5896697Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_jagged_softmax.py' 2025-05-07T20:03:52.5902147Z INFO:wheel:adding 'fbgemm_gpu/sll/triton/triton_multi_head_jagged_flash_attention.py' 2025-05-07T20:03:52.5905038Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/__init__.py' 2025-05-07T20:03:52.5908089Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adagrad.py' 2025-05-07T20:03:52.5911893Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_adam.py' 2025-05-07T20:03:52.5914367Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args.py' 2025-05-07T20:03:52.5916520Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_args_ssd.py' 2025-05-07T20:03:52.5919553Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lamb.py' 2025-05-07T20:03:52.5922873Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_lars_sgd.py' 2025-05-07T20:03:52.5925895Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_none.py' 2025-05-07T20:03:52.5929209Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_adam.py' 2025-05-07T20:03:52.5932678Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_partial_rowwise_lamb.py' 2025-05-07T20:03:52.5936052Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad.py' 2025-05-07T20:03:52.5939280Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_ssd.py' 2025-05-07T20:03:52.5942833Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_rowwise_adagrad_with_counter.py' 2025-05-07T20:03:52.5945787Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_codegen_lookup_invokers/lookup_sgd.py' 2025-05-07T20:03:52.5947985Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/optimizer_args.py' 2025-05-07T20:03:52.5950701Z INFO:wheel:adding 'fbgemm_gpu/split_embedding_optimizer_codegen/split_embedding_optimizer_rowwise_adagrad.py' 2025-05-07T20:03:52.5952314Z INFO:wheel:adding 'fbgemm_gpu/tbe/__init__.py' 2025-05-07T20:03:52.5954573Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/__init__.py' 2025-05-07T20:03:52.5956805Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_config.py' 2025-05-07T20:03:52.5961788Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/bench_runs.py' 2025-05-07T20:03:52.5964547Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eeg_cli.py' 2025-05-07T20:03:52.5967027Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/embedding_ops_common_config.py' 2025-05-07T20:03:52.5969082Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/eval_compression.py' 2025-05-07T20:03:52.5970747Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/reporter.py' 2025-05-07T20:03:52.5974035Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config.py' 2025-05-07T20:03:52.5976844Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_loader.py' 2025-05-07T20:03:52.5979483Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/tbe_data_config_param_models.py' 2025-05-07T20:03:52.5981284Z INFO:wheel:adding 'fbgemm_gpu/tbe/bench/utils.py' 2025-05-07T20:03:52.5983089Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/__init__.py' 2025-05-07T20:03:52.5984856Z INFO:wheel:adding 'fbgemm_gpu/tbe/cache/split_embeddings_cache_ops.py' 2025-05-07T20:03:52.5986537Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/__init__.py' 2025-05-07T20:03:52.5988077Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/common.py' 2025-05-07T20:03:52.5994241Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/inference.py' 2025-05-07T20:03:52.6020742Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/training.py' 2025-05-07T20:03:52.6023588Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/__init__.py' 2025-05-07T20:03:52.6026458Z INFO:wheel:adding 'fbgemm_gpu/tbe/ssd/utils/partially_materialized_tensor.py' 2025-05-07T20:03:52.6028285Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/__init__.py' 2025-05-07T20:03:52.6031156Z INFO:wheel:adding 'fbgemm_gpu/tbe/stats/bench_params_reporter.py' 2025-05-07T20:03:52.6033914Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/__init__.py' 2025-05-07T20:03:52.6035378Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/common.py' 2025-05-07T20:03:52.6037240Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/offsets.py' 2025-05-07T20:03:52.6039901Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/quantize.py' 2025-05-07T20:03:52.6045612Z INFO:wheel:adding 'fbgemm_gpu/tbe/utils/requests.py' 2025-05-07T20:03:52.6047949Z INFO:wheel:adding 'fbgemm_gpu/triton/__init__.py' 2025-05-07T20:03:52.6049844Z INFO:wheel:adding 'fbgemm_gpu/triton/common.py' 2025-05-07T20:03:52.6057459Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize.py' 2025-05-07T20:03:52.6062232Z INFO:wheel:adding 'fbgemm_gpu/triton/quantize_ref.py' 2025-05-07T20:03:52.6064348Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/__init__.py' 2025-05-07T20:03:52.6072334Z INFO:wheel:adding 'fbgemm_gpu/triton/jagged/triton_jagged_tensor_ops.py' 2025-05-07T20:03:52.6075043Z INFO:wheel:adding 'fbgemm_gpu/utils/__init__.py' 2025-05-07T20:03:52.6077402Z INFO:wheel:adding 'fbgemm_gpu/utils/filestore.py' 2025-05-07T20:03:52.6079140Z INFO:wheel:adding 'fbgemm_gpu/utils/loader.py' 2025-05-07T20:03:52.6081453Z INFO:wheel:adding 'fbgemm_gpu/utils/torch_library.py' 2025-05-07T20:03:52.6084418Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/METADATA' 2025-05-07T20:03:52.6085577Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/WHEEL' 2025-05-07T20:03:52.6086481Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/top_level.txt' 2025-05-07T20:03:52.6093012Z INFO:wheel:adding 'fbgemm_gpu_nightly-2025.5.7.dist-info/RECORD' 2025-05-07T20:03:52.6096968Z INFO:root:removing _skbuild/linux-x86_64-3.10/setuptools/bdist.linux-x86_64/wheel 2025-05-07T20:03:52.6919224Z ╒════════════════════════════╤════════════════════════════════════════════════╕ 2025-05-07T20:03:52.6919774Z │ │ Version │ 2025-05-07T20:03:52.6920304Z ╞════════════════════════════╪════════════════════════════════════════════════╡ 2025-05-07T20:03:52.6920973Z │ PyTorch │ 2.8.0.dev20250507+cu118 │ 2025-05-07T20:03:52.6921519Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:52.6922075Z │ CUDA (Declared by PyTorch) │ 11.8 │ 2025-05-07T20:03:52.6922665Z ├────────────────────────────┼────────────────────────────────────────────────┤ 2025-05-07T20:03:52.6923194Z │ CUDA (Actual) │ nvcc: NVIDIA (R) Cuda compiler driver │ 2025-05-07T20:03:52.6923748Z │ │ Copyright (c) 2005-2022 NVIDIA Corporation │ 2025-05-07T20:03:52.6924297Z │ │ Built on Wed_Sep_21_10:33:58_PDT_2022 │ 2025-05-07T20:03:52.6924788Z │ │ Cuda compilation tools, release 11.8, V11.8.89 │ 2025-05-07T20:03:52.6925295Z │ │ Build cuda_11.8.r11.8/compiler.31833905_0 │ 2025-05-07T20:03:52.6925855Z ╘════════════════════════════╧════════════════════════════════════════════════╛ 2025-05-07T20:03:52.9435065Z Successfully built fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:03:53.0317117Z 2025-05-07T20:03:53.0470113Z ################################################################################ 2025-05-07T20:03:53.0471516Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0473093Z [CHECK] Listing out library size: 2025-05-07T20:03:53.0474261Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0475221Z 2025-05-07T20:03:53.0480234Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0482054Z 2025-05-07T20:03:53.0483109Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0484052Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.0484601Z 2025-05-07T20:03:53.0549898Z GLIBC_2.2.5 2025-05-07T20:03:53.0550505Z GLIBC_2.14 2025-05-07T20:03:53.0550863Z 2025-05-07T20:03:53.0550894Z 2025-05-07T20:03:53.0551927Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0553650Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.0554224Z 2025-05-07T20:03:53.0605360Z GLIBCXX_3.4 2025-05-07T20:03:53.0605806Z 2025-05-07T20:03:53.0605820Z 2025-05-07T20:03:53.0628849Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so > /tmp/tmp.BTQaF6zI0k.symbols.txt 2025-05-07T20:03:53.0630120Z 2025-05-07T20:03:53.0661958Z 2025-05-07T20:03:53.0685556Z [CHECK] Total Number of symbols: 841 2025-05-07T20:03:53.0702664Z [CHECK] Number of fbgemm symbols: 0 2025-05-07T20:03:53.0716935Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so > /tmp/tmp.sofNhWWejR.usymbols.txt 2025-05-07T20:03:53.0718199Z 2025-05-07T20:03:53.0733517Z 2025-05-07T20:03:53.0757974Z [CHECK] Listing out undefined symbols (51 total): 2025-05-07T20:03:53.0775655Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.0776643Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.0777596Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.0778866Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.0779803Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:53.0780731Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.0781784Z U abort@GLIBC_2.2.5 2025-05-07T20:03:53.0782606Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:53.0783036Z U close@GLIBC_2.2.5 2025-05-07T20:03:53.0783331Z U fputs@GLIBC_2.2.5 2025-05-07T20:03:53.0783612Z U free@GLIBC_2.2.5 2025-05-07T20:03:53.0783979Z U ftruncate64@GLIBC_2.2.5 2025-05-07T20:03:53.0784284Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:53.0784586Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:53.0784884Z U getpagesize@GLIBC_2.2.5 2025-05-07T20:03:53.0785204Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:53.0785494Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:53.0785799Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:53.0786199Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.0786504Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:53.0786799Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.0787079Z U mmap@GLIBC_2.2.5 2025-05-07T20:03:53.0787379Z U mprotect@GLIBC_2.2.5 2025-05-07T20:03:53.0787672Z U munmap@GLIBC_2.2.5 2025-05-07T20:03:53.0787972Z U open64@GLIBC_2.2.5 2025-05-07T20:03:53.0788395Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.0788719Z U pthread_mutex_destroy@GLIBC_2.2.5 2025-05-07T20:03:53.0789045Z U pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:53.0789353Z U pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:53.0789664Z U read@GLIBC_2.2.5 2025-05-07T20:03:53.0789925Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:53.0790208Z U shm_open@GLIBC_2.2.5 2025-05-07T20:03:53.0790485Z U shm_unlink@GLIBC_2.2.5 2025-05-07T20:03:53.0790777Z U snprintf@GLIBC_2.2.5 2025-05-07T20:03:53.0791100Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.0791390Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:53.0791671Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:53.0791942Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.0792222Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:53.0792489Z U syscall@GLIBC_2.2.5 2025-05-07T20:03:53.0792897Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:53.0793352Z U uname@GLIBC_2.2.5 2025-05-07T20:03:53.0793655Z U unlink@GLIBC_2.2.5 2025-05-07T20:03:53.0793960Z U vsnprintf@GLIBC_2.2.5 2025-05-07T20:03:53.0794322Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.0794767Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.0795207Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.0795614Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.0795940Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.0796272Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.0796676Z w __gmon_start__ 2025-05-07T20:03:53.0797009Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.0797433Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0797692Z 2025-05-07T20:03:53.0819303Z linux-vdso.so.1 (0x00007ffd0a14a000) 2025-05-07T20:03:53.0819834Z libtorch_cpu.so => not found 2025-05-07T20:03:53.0820121Z libtorch_cuda.so => not found 2025-05-07T20:03:53.0820407Z libtorch.so => not found 2025-05-07T20:03:53.0820755Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fe5682fd000) 2025-05-07T20:03:53.0821181Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fe5682a7000) 2025-05-07T20:03:53.0823199Z librt.so.1 => /lib64/librt.so.1 (0x00007fe5682a0000) 2025-05-07T20:03:53.0823597Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fe568272000) 2025-05-07T20:03:53.0824049Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fe56826d000) 2025-05-07T20:03:53.0824507Z libc.so.6 => /lib64/libc.so.6 (0x00007fe568065000) 2025-05-07T20:03:53.0824887Z libm.so.6 => /lib64/libm.so.6 (0x00007fe567f8a000) 2025-05-07T20:03:53.0825366Z /lib64/ld-linux-x86-64.so.2 (0x00007fe5685dd000) 2025-05-07T20:03:53.0825628Z 2025-05-07T20:03:53.0825845Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.0826246Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so 2025-05-07T20:03:53.0826517Z 2025-05-07T20:03:53.0855243Z 2025-05-07T20:03:53.0855899Z Dynamic section at offset 0x74dd0 contains 35 entries: 2025-05-07T20:03:53.0857145Z Tag Type Name/Value 2025-05-07T20:03:53.0858436Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.0860028Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.0861176Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.0861680Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.0862210Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.0862720Z 0x0000000000000001 (NEEDED) Shared library: [librt.so.1] 2025-05-07T20:03:53.0863212Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.0863736Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:03:53.0864244Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.0864741Z 0x000000000000000e (SONAME) Library soname: [asmjit.so] 2025-05-07T20:03:53.0865151Z 0x000000000000000c (INIT) 0x19000 2025-05-07T20:03:53.0865499Z 0x000000000000000d (FINI) 0x56a1c 2025-05-07T20:03:53.0865822Z 0x0000000000000019 (INIT_ARRAY) 0x74ff8 2025-05-07T20:03:53.0866171Z 0x000000000000001b (INIT_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.0866521Z 0x000000000000001a (FINI_ARRAY) 0x75000 2025-05-07T20:03:53.0866856Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.0867207Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.0867534Z 0x0000000000000005 (STRTAB) 0x7120 2025-05-07T20:03:53.0868093Z 0x0000000000000006 (SYMTAB) 0x2230 2025-05-07T20:03:53.0868452Z 0x000000000000000a (STRSZ) 48790 (bytes) 2025-05-07T20:03:53.0868829Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.0869190Z 0x0000000000000003 (PLTGOT) 0x76050 2025-05-07T20:03:53.0869548Z 0x0000000000000002 (PLTRELSZ) 8472 (bytes) 2025-05-07T20:03:53.0869905Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.0870230Z 0x0000000000000017 (JMPREL) 0x16a58 2025-05-07T20:03:53.0870572Z 0x0000000000000007 (RELA) 0x13710 2025-05-07T20:03:53.0870919Z 0x0000000000000008 (RELASZ) 13128 (bytes) 2025-05-07T20:03:53.0871290Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.0871619Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.0872096Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.0872471Z 0x000000006ffffffe (VERNEED) 0x13650 2025-05-07T20:03:53.0872922Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:53.0873264Z 0x000000006ffffff0 (VERSYM) 0x12fb6 2025-05-07T20:03:53.0873590Z 0x000000006ffffff9 (RELACOUNT) 3 2025-05-07T20:03:53.0873912Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.0874120Z 2025-05-07T20:03:53.0874235Z ################################################################################ 2025-05-07T20:03:53.0874481Z 2025-05-07T20:03:53.0874596Z 2025-05-07T20:03:53.0874715Z ################################################################################ 2025-05-07T20:03:53.0875217Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.0875696Z [CHECK] Listing out library size: 2025-05-07T20:03:53.0876197Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.0876560Z 2025-05-07T20:03:53.0876762Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.0877063Z 2025-05-07T20:03:53.0877469Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.0878440Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.0879019Z 2025-05-07T20:03:53.0915752Z GLIBC_2.2.5 2025-05-07T20:03:53.0916395Z GLIBC_2.14 2025-05-07T20:03:53.0916784Z 2025-05-07T20:03:53.0916797Z 2025-05-07T20:03:53.0917997Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.0920957Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.0922792Z 2025-05-07T20:03:53.0964741Z GLIBCXX_3.4 2025-05-07T20:03:53.0965364Z GLIBCXX_3.4.9 2025-05-07T20:03:53.0965973Z GLIBCXX_3.4.21 2025-05-07T20:03:53.0966348Z 2025-05-07T20:03:53.0966362Z 2025-05-07T20:03:53.0987671Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.s6fLNOognb.symbols.txt 2025-05-07T20:03:53.0989065Z 2025-05-07T20:03:53.1003621Z 2025-05-07T20:03:53.1031590Z [CHECK] Total Number of symbols: 116 2025-05-07T20:03:53.1049280Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:53.1067891Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so > /tmp/tmp.HplopVplcs.usymbols.txt 2025-05-07T20:03:53.1068412Z 2025-05-07T20:03:53.1092170Z 2025-05-07T20:03:53.1123112Z [CHECK] Listing out undefined symbols (59 total): 2025-05-07T20:03:53.1147135Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.1147800Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.1148246Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.1148579Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.1148896Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.1149229Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.1149552Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.1149879Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.1150189Z U __errno_location@GLIBC_2.2.5 2025-05-07T20:03:53.1150528Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.1150854Z U c10::BoolType::get() 2025-05-07T20:03:53.1151172Z U c10::StringType::get() 2025-05-07T20:03:53.1151520Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.1152286Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.1153828Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.1154665Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:53.1154957Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:53.1155257Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.1155539Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.1155857Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.1156209Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.1156681Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:53.1157374Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:53.1158266Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.1159190Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.1160116Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1161182Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.1162258Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1163194Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1164254Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1165349Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1165954Z U std::__throw_invalid_argument(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.1166346Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.1166733Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.1167094Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.1167566Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.1168461Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.1169218Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.1169565Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.1169894Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.1170227Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.1170552Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.1170846Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.1171129Z U strtol@GLIBC_2.2.5 2025-05-07T20:03:53.1171422Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.1172211Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.1173600Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:53.1174560Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.1175201Z U typeinfo for std::invalid_argument@GLIBCXX_3.4 2025-05-07T20:03:53.1175604Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.1176007Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.1176458Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.1177027Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.1177698Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.1178137Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.1178443Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.1178753Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.1179060Z w __gmon_start__ 2025-05-07T20:03:53.1179348Z w __pthread_key_create@GLIBC_2.2.5 2025-05-07T20:03:53.1179702Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.1180139Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.1180423Z 2025-05-07T20:03:53.1197359Z linux-vdso.so.1 (0x00007ffe3e51c000) 2025-05-07T20:03:53.1198260Z libtorch.so => not found 2025-05-07T20:03:53.1198979Z libc10.so => not found 2025-05-07T20:03:53.1199672Z libtorch_cpu.so => not found 2025-05-07T20:03:53.1200462Z libtorch_cuda.so => not found 2025-05-07T20:03:53.1201422Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f089233b000) 2025-05-07T20:03:53.1202656Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f08922e3000) 2025-05-07T20:03:53.1203437Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f08922b5000) 2025-05-07T20:03:53.1203878Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f08922b0000) 2025-05-07T20:03:53.1204306Z libc.so.6 => /lib64/libc.so.6 (0x00007f08920a8000) 2025-05-07T20:03:53.1204662Z libm.so.6 => /lib64/libm.so.6 (0x00007f0891fcd000) 2025-05-07T20:03:53.1205037Z /lib64/ld-linux-x86-64.so.2 (0x00007f08925ae000) 2025-05-07T20:03:53.1205277Z 2025-05-07T20:03:53.1205387Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.1205817Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so 2025-05-07T20:03:53.1206148Z 2025-05-07T20:03:53.1236352Z 2025-05-07T20:03:53.1236641Z Dynamic section at offset 0x8aa8 contains 35 entries: 2025-05-07T20:03:53.1237041Z Tag Type Name/Value 2025-05-07T20:03:53.1237497Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.1238184Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.1238708Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.1239250Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.1239772Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.1240304Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.1240816Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.1241347Z 0x0000000000000001 (NEEDED) Shared library: [libpthread.so.0] 2025-05-07T20:03:53.1241861Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.1242405Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_config.so] 2025-05-07T20:03:53.1242863Z 0x000000000000000c (INIT) 0x4000 2025-05-07T20:03:53.1243333Z 0x000000000000000d (FINI) 0x6890 2025-05-07T20:03:53.1243678Z 0x0000000000000019 (INIT_ARRAY) 0x99c0 2025-05-07T20:03:53.1244021Z 0x000000000000001b (INIT_ARRAYSZ) 16 (bytes) 2025-05-07T20:03:53.1244385Z 0x000000000000001a (FINI_ARRAY) 0x99d0 2025-05-07T20:03:53.1244738Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.1245087Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.1245424Z 0x0000000000000005 (STRTAB) 0xff0 2025-05-07T20:03:53.1245739Z 0x0000000000000006 (SYMTAB) 0x4f8 2025-05-07T20:03:53.1246094Z 0x000000000000000a (STRSZ) 7890 (bytes) 2025-05-07T20:03:53.1246496Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.1246850Z 0x0000000000000003 (PLTGOT) 0x9d28 2025-05-07T20:03:53.1247200Z 0x0000000000000002 (PLTRELSZ) 1632 (bytes) 2025-05-07T20:03:53.1248888Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.1249229Z 0x0000000000000017 (JMPREL) 0x3520 2025-05-07T20:03:53.1249555Z 0x0000000000000007 (RELA) 0x3070 2025-05-07T20:03:53.1249912Z 0x0000000000000008 (RELASZ) 1200 (bytes) 2025-05-07T20:03:53.1250315Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.1250662Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.1250988Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.1251355Z 0x000000006ffffffe (VERNEED) 0x2fb0 2025-05-07T20:03:53.1251685Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:53.1252021Z 0x000000006ffffff0 (VERSYM) 0x2ec2 2025-05-07T20:03:53.1252362Z 0x000000006ffffff9 (RELACOUNT) 4 2025-05-07T20:03:53.1252667Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.1252872Z 2025-05-07T20:03:53.1253001Z ################################################################################ 2025-05-07T20:03:53.1253236Z 2025-05-07T20:03:53.1253240Z 2025-05-07T20:03:53.1253359Z ################################################################################ 2025-05-07T20:03:53.1253869Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1254370Z [CHECK] Listing out library size: 2025-05-07T20:03:53.1254822Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1255192Z 2025-05-07T20:03:53.1255405Z 11 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1255712Z 2025-05-07T20:03:53.1256394Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1257437Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.1258040Z 2025-05-07T20:03:53.1337879Z GLIBC_2.2.5 2025-05-07T20:03:53.1338529Z GLIBC_2.14 2025-05-07T20:03:53.1342752Z 2025-05-07T20:03:53.1342766Z 2025-05-07T20:03:53.1344076Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1347108Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.1348945Z 2025-05-07T20:03:53.1425732Z GLIBCXX_3.4 2025-05-07T20:03:53.1426393Z GLIBCXX_3.4.9 2025-05-07T20:03:53.1427011Z GLIBCXX_3.4.11 2025-05-07T20:03:53.1427619Z GLIBCXX_3.4.20 2025-05-07T20:03:53.1428192Z GLIBCXX_3.4.21 2025-05-07T20:03:53.1428564Z 2025-05-07T20:03:53.1428611Z 2025-05-07T20:03:53.1452287Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.aR7bOZvS5w.symbols.txt 2025-05-07T20:03:53.1452778Z 2025-05-07T20:03:53.1502799Z 2025-05-07T20:03:53.1531173Z [CHECK] Total Number of symbols: 819 2025-05-07T20:03:53.1543395Z [CHECK] Number of fbgemm symbols: 73 2025-05-07T20:03:53.1559249Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so > /tmp/tmp.IoIdXIv6kA.usymbols.txt 2025-05-07T20:03:53.1560695Z 2025-05-07T20:03:53.1579414Z 2025-05-07T20:03:53.1603226Z [CHECK] Listing out undefined symbols (152 total): 2025-05-07T20:03:53.1620314Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.1621079Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.1621665Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.1622092Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.1622638Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.1623047Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:53.1623439Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:53.1623880Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:53.1624262Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.1624752Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.1625181Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.1625526Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.1625838Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.1626138Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.1626460Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.1626758Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.1627071Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.1627407Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:53.1627809Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:53.1628541Z U at::_ops::arange::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.1629631Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.1630919Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.1632391Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:53.1633458Z U at::_ops::full_like::call(at::Tensor const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.1634452Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:53.1635169Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:53.1636144Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.1637351Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.1638244Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:53.1638780Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:53.1639130Z U c10::BoolType::get() 2025-05-07T20:03:53.1639495Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.1641104Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:53.1641508Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.1641916Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.1642261Z U c10::IntType::get() 2025-05-07T20:03:53.1642653Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.1643129Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.1643533Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.1644204Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:53.1644830Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:53.1645225Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.1645588Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.1645946Z U c10::TensorType::get() 2025-05-07T20:03:53.1646254Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.1647203Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:53.1648127Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:53.1648468Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:53.1648812Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:53.1649141Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:53.1649476Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:53.1649801Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:53.1650262Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:53.1650719Z U c10::cuda::current_device() 2025-05-07T20:03:53.1651014Z U c10::cuda::device_count() 2025-05-07T20:03:53.1651348Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:53.1651707Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:53.1652086Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:53.1652471Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:53.1652857Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:53.1653232Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:53.1653936Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.1654788Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.1655622Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.1656516Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.1657726Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.1658548Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:53.1659059Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:53.1659441Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:53.1659932Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:53.1660356Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:53.1660747Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:53.1661143Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:53.1661523Z U c10::throwNullDataPtrError() 2025-05-07T20:03:53.1661855Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:53.1662201Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:53.1662624Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.1663103Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:53.1663484Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:53.1663894Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.1664291Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.1664667Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:53.1665037Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:53.1665421Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:53.1665783Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:53.1666154Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.1666527Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:53.1666898Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:53.1667252Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:53.1667624Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:53.1667976Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:53.1668339Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:53.1668692Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:53.1669238Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.1669801Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:53.1670159Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:53.1670530Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:53.1670895Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.1671279Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:53.1671677Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.1672097Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.1672497Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.1672941Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:53.1673333Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:53.1673806Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.1674247Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.1674661Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.1674980Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.1675335Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.1675709Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.1676061Z U printf@GLIBC_2.2.5 2025-05-07T20:03:53.1676355Z U puts@GLIBC_2.2.5 2025-05-07T20:03:53.1676920Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.1677796Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.1678768Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1679860Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.1680940Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1681871Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1682904Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:53.1684072Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.1684965Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.1685391Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.1685794Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:53.1686210Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:53.1686684Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.1687567Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.1688349Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.1688706Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.1689045Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.1689387Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.1689769Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.1690287Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.1690720Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.1691204Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.1691528Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.1692528Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.1693735Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.1694595Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.1695341Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.1696051Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.1696544Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.1696988Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.1697440Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.1698057Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.1698788Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.1699264Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.1699594Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.1699920Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.1700222Z w __gmon_start__ 2025-05-07T20:03:53.1700512Z w __pthread_key_create 2025-05-07T20:03:53.1700822Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:53.1701170Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:53.1701542Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.1702045Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1702365Z 2025-05-07T20:03:53.1702527Z linux-vdso.so.1 (0x00007ffe249c6000) 2025-05-07T20:03:53.1702830Z libtorch.so => not found 2025-05-07T20:03:53.1703128Z libc10.so => not found 2025-05-07T20:03:53.1703378Z libc10_cuda.so => not found 2025-05-07T20:03:53.1703667Z libtorch_cpu.so => not found 2025-05-07T20:03:53.1703956Z libtorch_cuda.so => not found 2025-05-07T20:03:53.1704230Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.1704587Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f3fe839c000) 2025-05-07T20:03:53.1705011Z libm.so.6 => /lib64/libm.so.6 (0x00007f3fe93a8000) 2025-05-07T20:03:53.1705410Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f3fe9352000) 2025-05-07T20:03:53.1705818Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f3fe9324000) 2025-05-07T20:03:53.1706217Z libc.so.6 => /lib64/libc.so.6 (0x00007f3fe8194000) 2025-05-07T20:03:53.1706578Z /lib64/ld-linux-x86-64.so.2 (0x00007f3fe9489000) 2025-05-07T20:03:53.1706830Z 2025-05-07T20:03:53.1706942Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.1707389Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so 2025-05-07T20:03:53.1707729Z 2025-05-07T20:03:53.1707735Z 2025-05-07T20:03:53.1707894Z Dynamic section at offset 0xa76868 contains 37 entries: 2025-05-07T20:03:53.1708290Z Tag Type Name/Value 2025-05-07T20:03:53.1708717Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.1709241Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.1709761Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:53.1710283Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.1710819Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.1711349Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:53.1711895Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.1712400Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:53.1712998Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.1713525Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.1714101Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.1714646Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:53.1715100Z 0x000000000000000c (INIT) 0x2e000 2025-05-07T20:03:53.1715445Z 0x000000000000000d (FINI) 0xc47fc 2025-05-07T20:03:53.1715782Z 0x0000000000000019 (INIT_ARRAY) 0xa75ea0 2025-05-07T20:03:53.1716154Z 0x000000000000001b (INIT_ARRAYSZ) 208 (bytes) 2025-05-07T20:03:53.1716519Z 0x000000000000001a (FINI_ARRAY) 0xa75f70 2025-05-07T20:03:53.1716866Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.1717225Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.1717557Z 0x0000000000000005 (STRTAB) 0x6b50 2025-05-07T20:03:53.1717902Z 0x0000000000000006 (SYMTAB) 0x1e70 2025-05-07T20:03:53.1718300Z 0x000000000000000a (STRSZ) 120164 (bytes) 2025-05-07T20:03:53.1718689Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.1719038Z 0x0000000000000003 (PLTGOT) 0xa77b08 2025-05-07T20:03:53.1719414Z 0x0000000000000002 (PLTRELSZ) 10416 (bytes) 2025-05-07T20:03:53.1719776Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.1720103Z 0x0000000000000017 (JMPREL) 0x2aa30 2025-05-07T20:03:53.1720475Z 0x0000000000000007 (RELA) 0x24820 2025-05-07T20:03:53.1720845Z 0x0000000000000008 (RELASZ) 25104 (bytes) 2025-05-07T20:03:53.1721246Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.1721623Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.1721979Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.1722361Z 0x000000006ffffffe (VERNEED) 0x24720 2025-05-07T20:03:53.1722741Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:53.1723117Z 0x000000006ffffff0 (VERSYM) 0x240b4 2025-05-07T20:03:53.1723472Z 0x000000006ffffff9 (RELACOUNT) 176 2025-05-07T20:03:53.1723831Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.1724048Z 2025-05-07T20:03:53.1724204Z ################################################################################ 2025-05-07T20:03:53.1724463Z 2025-05-07T20:03:53.1724468Z 2025-05-07T20:03:53.1724586Z ################################################################################ 2025-05-07T20:03:53.1725247Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.1725740Z [CHECK] Listing out library size: 2025-05-07T20:03:53.1726223Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.1726593Z 2025-05-07T20:03:53.1726809Z 5 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.1727144Z 2025-05-07T20:03:53.1727533Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.1728509Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.1729077Z 2025-05-07T20:03:53.1765474Z GLIBC_2.2.5 2025-05-07T20:03:53.1766136Z GLIBC_2.3 2025-05-07T20:03:53.1766685Z GLIBC_2.14 2025-05-07T20:03:53.1767067Z 2025-05-07T20:03:53.1767081Z 2025-05-07T20:03:53.1768351Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.1771558Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.1784726Z 2025-05-07T20:03:53.1828473Z GLIBCXX_3.4 2025-05-07T20:03:53.1829135Z GLIBCXX_3.4.9 2025-05-07T20:03:53.1829773Z GLIBCXX_3.4.11 2025-05-07T20:03:53.1830358Z GLIBCXX_3.4.18 2025-05-07T20:03:53.1830973Z GLIBCXX_3.4.21 2025-05-07T20:03:53.1831761Z 2025-05-07T20:03:53.1831775Z 2025-05-07T20:03:53.1850072Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.ADp3By9F7A.symbols.txt 2025-05-07T20:03:53.1851609Z 2025-05-07T20:03:53.1874650Z 2025-05-07T20:03:53.1900515Z [CHECK] Total Number of symbols: 338 2025-05-07T20:03:53.1913618Z [CHECK] Number of fbgemm symbols: 16 2025-05-07T20:03:53.1930148Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so > /tmp/tmp.OcgTMz0rbf.usymbols.txt 2025-05-07T20:03:53.1932081Z 2025-05-07T20:03:53.1948906Z 2025-05-07T20:03:53.1973892Z [CHECK] Listing out undefined symbols (128 total): 2025-05-07T20:03:53.1989021Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.1991880Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.1992458Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.1992931Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.1993362Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.1993761Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.1994170Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:53.1994571Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:53.1994933Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:53.1995388Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.1995741Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.1996067Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.1996433Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.1996757Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.1997086Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:53.1997417Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.1997749Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:53.1998121Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:53.1998542Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:53.1999118Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:53.1999588Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:53.1999974Z U c10::BoolType::get() 2025-05-07T20:03:53.2000328Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.2000707Z U c10::FloatType::get() 2025-05-07T20:03:53.2001021Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:53.2001429Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.2001854Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.2002213Z U c10::IntType::get() 2025-05-07T20:03:53.2002585Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:53.2003157Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:53.2003548Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.2003954Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:53.2004648Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:53.2005323Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:53.2005691Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.2006029Z U c10::TensorType::get() 2025-05-07T20:03:53.2006360Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.2007337Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:53.2008338Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:53.2008704Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:53.2009070Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:53.2009422Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:53.2009780Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:53.2010144Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:53.2010627Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:53.2011153Z U c10::cuda::device_count() 2025-05-07T20:03:53.2011504Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:53.2011902Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:53.2012301Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:53.2012717Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:53.2013147Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:53.2013540Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:53.2014337Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.2015233Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.2016158Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.2017163Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.2018221Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.2019296Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:53.2019652Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:53.2020000Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:53.2020389Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:53.2020788Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:53.2021166Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:53.2021704Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.2022142Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.2022533Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.2022894Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:53.2023258Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:53.2023605Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:53.2023954Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:53.2024314Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.2024683Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:53.2025063Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:53.2025405Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:53.2025759Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:53.2026097Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:53.2026462Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.2026836Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:53.2027182Z U float at::Tensor::item() const 2025-05-07T20:03:53.2027573Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.2028283Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.2028700Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.2029234Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.2029539Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.2029862Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.2030241Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.2030890Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.2031969Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.2032973Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.2034061Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.2035156Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.2036131Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.2037199Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.2038331Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:53.2039185Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:53.2039815Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:53.2040162Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:53.2040548Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.2040952Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.2041360Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:53.2041863Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.2042816Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.2043645Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.2044017Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.2044367Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.2044836Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.2045336Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.2045855Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.2046309Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:53.2046650Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.2046952Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.2047248Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.2048036Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.2049144Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.2049924Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.2050624Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.2051232Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.2051642Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.2052058Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.2052622Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.2053265Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.2053692Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.2054030Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.2054335Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.2054618Z w __gmon_start__ 2025-05-07T20:03:53.2054887Z w __pthread_key_create 2025-05-07T20:03:53.2055213Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:53.2055539Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:53.2055887Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.2056341Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.2056654Z 2025-05-07T20:03:53.2056818Z linux-vdso.so.1 (0x00007ffddeb7f000) 2025-05-07T20:03:53.2057096Z libtorch.so => not found 2025-05-07T20:03:53.2057343Z libc10.so => not found 2025-05-07T20:03:53.2057570Z libc10_cuda.so => not found 2025-05-07T20:03:53.2057829Z libtorch_cpu.so => not found 2025-05-07T20:03:53.2058080Z libtorch_cuda.so => not found 2025-05-07T20:03:53.2058349Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.2058668Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f30b919c000) 2025-05-07T20:03:53.2059077Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f30b9a22000) 2025-05-07T20:03:53.2059470Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f30b99f4000) 2025-05-07T20:03:53.2059826Z libc.so.6 => /lib64/libc.so.6 (0x00007f30b8f94000) 2025-05-07T20:03:53.2060181Z /lib64/ld-linux-x86-64.so.2 (0x00007f30b9a7e000) 2025-05-07T20:03:53.2060515Z libm.so.6 => /lib64/libm.so.6 (0x00007f30b9919000) 2025-05-07T20:03:53.2060747Z 2025-05-07T20:03:53.2060852Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.2061445Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so 2025-05-07T20:03:53.2061812Z 2025-05-07T20:03:53.2064403Z 2025-05-07T20:03:53.2064718Z Dynamic section at offset 0x467450 contains 37 entries: 2025-05-07T20:03:53.2065128Z Tag Type Name/Value 2025-05-07T20:03:53.2065579Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.2066351Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.2066853Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:53.2067380Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.2067905Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.2068490Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:53.2069019Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.2069517Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.2070017Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.2070503Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.2071018Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:53.2071572Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_optimizers.so] 2025-05-07T20:03:53.2072036Z 0x000000000000000c (INIT) 0xf000 2025-05-07T20:03:53.2072363Z 0x000000000000000d (FINI) 0x31c4c 2025-05-07T20:03:53.2072758Z 0x0000000000000019 (INIT_ARRAY) 0x467fe0 2025-05-07T20:03:53.2073214Z 0x000000000000001b (INIT_ARRAYSZ) 48 (bytes) 2025-05-07T20:03:53.2073554Z 0x000000000000001a (FINI_ARRAY) 0x468010 2025-05-07T20:03:53.2073895Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.2074249Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:53.2074574Z 0x0000000000000005 (STRTAB) 0x2cc8 2025-05-07T20:03:53.2074898Z 0x0000000000000006 (SYMTAB) 0xd00 2025-05-07T20:03:53.2075228Z 0x000000000000000a (STRSZ) 38026 (bytes) 2025-05-07T20:03:53.2075586Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.2075956Z 0x0000000000000003 (PLTGOT) 0x4686f0 2025-05-07T20:03:53.2076321Z 0x0000000000000002 (PLTRELSZ) 4752 (bytes) 2025-05-07T20:03:53.2076655Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.2077008Z 0x0000000000000017 (JMPREL) 0xdab0 2025-05-07T20:03:53.2077326Z 0x0000000000000007 (RELA) 0xc508 2025-05-07T20:03:53.2077672Z 0x0000000000000008 (RELASZ) 5544 (bytes) 2025-05-07T20:03:53.2078023Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.2078337Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.2078692Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.2079032Z 0x000000006ffffffe (VERNEED) 0xc3f8 2025-05-07T20:03:53.2079356Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:53.2079666Z 0x000000006ffffff0 (VERSYM) 0xc152 2025-05-07T20:03:53.2079981Z 0x000000006ffffff9 (RELACOUNT) 58 2025-05-07T20:03:53.2080279Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.2080487Z 2025-05-07T20:03:53.2080599Z ################################################################################ 2025-05-07T20:03:53.2080822Z 2025-05-07T20:03:53.2080828Z 2025-05-07T20:03:53.2080947Z ################################################################################ 2025-05-07T20:03:53.2081377Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.2081795Z [CHECK] Listing out library size: 2025-05-07T20:03:53.2082184Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.2082496Z 2025-05-07T20:03:53.2082643Z 6 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.2082880Z 2025-05-07T20:03:53.2083214Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.2084063Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.2084604Z 2025-05-07T20:03:53.2349866Z GLIBC_2.2.5 2025-05-07T20:03:53.2350194Z GLIBC_2.3 2025-05-07T20:03:53.2350387Z GLIBC_2.14 2025-05-07T20:03:53.2350496Z 2025-05-07T20:03:53.2350512Z 2025-05-07T20:03:53.2350877Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.2351780Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.2352344Z 2025-05-07T20:03:53.2622208Z GLIBCXX_3.4 2025-05-07T20:03:53.2622900Z GLIBCXX_3.4.9 2025-05-07T20:03:53.2623477Z GLIBCXX_3.4.11 2025-05-07T20:03:53.2624056Z GLIBCXX_3.4.14 2025-05-07T20:03:53.2624605Z GLIBCXX_3.4.15 2025-05-07T20:03:53.2625166Z GLIBCXX_3.4.18 2025-05-07T20:03:53.2625712Z GLIBCXX_3.4.21 2025-05-07T20:03:53.2626068Z 2025-05-07T20:03:53.2626082Z 2025-05-07T20:03:53.2641249Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so > /tmp/tmp.WSMCKddWaI.symbols.txt 2025-05-07T20:03:53.2642305Z 2025-05-07T20:03:53.2866247Z 2025-05-07T20:03:53.2891716Z [CHECK] Total Number of symbols: 4957 2025-05-07T20:03:53.2909054Z [CHECK] Number of fbgemm symbols: 3554 2025-05-07T20:03:53.2926530Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so > /tmp/tmp.afcgh3zU0r.usymbols.txt 2025-05-07T20:03:53.2928158Z 2025-05-07T20:03:53.2953811Z 2025-05-07T20:03:53.2980433Z [CHECK] Listing out undefined symbols (135 total): 2025-05-07T20:03:53.2994643Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.2995741Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:53.2996723Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.2997620Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.2998528Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.2999451Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:53.3000692Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.3001589Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.3002521Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.3003520Z U __cxa_init_primary_exception@CXXABI_1.3.11 2025-05-07T20:03:53.3004622Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.3005164Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:53.3005467Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:53.3005772Z U __extendhfsf2@GCC_12.0.0 2025-05-07T20:03:53.3006119Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.3006559Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:53.3006847Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:53.3007140Z U __truncsfhf2@GCC_12.0.0 2025-05-07T20:03:53.3007426Z U abort@GLIBC_2.2.5 2025-05-07T20:03:53.3007882Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:53.3008632Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:53.3009677Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:53.3010791Z U asmjit::_abi_1_13::BaseEmitter::_emitI(unsigned int, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&, asmjit::_abi_1_13::Operand_ const&) 2025-05-07T20:03:53.3011876Z U asmjit::_abi_1_13::BaseEmitter::emitArgsAssignment(asmjit::_abi_1_13::FuncFrame const&, asmjit::_abi_1_13::FuncArgsAssignment const&) 2025-05-07T20:03:53.3012632Z U asmjit::_abi_1_13::BaseEmitter::emitEpilog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:53.3013177Z U asmjit::_abi_1_13::BaseEmitter::emitProlog(asmjit::_abi_1_13::FuncFrame const&) 2025-05-07T20:03:53.3013754Z U asmjit::_abi_1_13::CodeHolder::CodeHolder(asmjit::_abi_1_13::Support::Temporary const*) 2025-05-07T20:03:53.3014350Z U asmjit::_abi_1_13::CodeHolder::init(asmjit::_abi_1_13::Environment const&, unsigned long) 2025-05-07T20:03:53.3014824Z U asmjit::_abi_1_13::CodeHolder::~CodeHolder() 2025-05-07T20:03:53.3015347Z U asmjit::_abi_1_13::FuncArgsAssignment::updateFuncFrame(asmjit::_abi_1_13::FuncFrame&) const 2025-05-07T20:03:53.3016054Z U asmjit::_abi_1_13::FuncDetail::init(asmjit::_abi_1_13::FuncSignature const&, asmjit::_abi_1_13::Environment const&) 2025-05-07T20:03:53.3016609Z U asmjit::_abi_1_13::FuncFrame::finalize() 2025-05-07T20:03:53.3017012Z U asmjit::_abi_1_13::FuncFrame::init(asmjit::_abi_1_13::FuncDetail const&) 2025-05-07T20:03:53.3017590Z U asmjit::_abi_1_13::JitRuntime::JitRuntime(asmjit::_abi_1_13::JitAllocator::CreateParams const*) 2025-05-07T20:03:53.3018090Z U asmjit::_abi_1_13::JitRuntime::~JitRuntime() 2025-05-07T20:03:53.3018505Z U asmjit::_abi_1_13::x86::Assembler::Assembler(asmjit::_abi_1_13::CodeHolder*) 2025-05-07T20:03:53.3018969Z U asmjit::_abi_1_13::x86::Assembler::~Assembler() 2025-05-07T20:03:53.3019280Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:53.3019541Z U ceilf@GLIBC_2.2.5 2025-05-07T20:03:53.3019841Z U cpuinfo_get_packages 2025-05-07T20:03:53.3020121Z U cpuinfo_get_packages_count 2025-05-07T20:03:53.3020405Z U cpuinfo_initialize 2025-05-07T20:03:53.3020673Z U cpuinfo_isa 2025-05-07T20:03:53.3020906Z U floor@GLIBC_2.2.5 2025-05-07T20:03:53.3021164Z U fma@GLIBC_2.2.5 2025-05-07T20:03:53.3021414Z U fmaf@GLIBC_2.2.5 2025-05-07T20:03:53.3021713Z U free@GLIBC_2.2.5 2025-05-07T20:03:53.3021972Z U fwrite@GLIBC_2.2.5 2025-05-07T20:03:53.3022253Z U getenv@GLIBC_2.2.5 2025-05-07T20:03:53.3022518Z U ldexp@GLIBC_2.2.5 2025-05-07T20:03:53.3022826Z U log2@GLIBC_2.2.5 2025-05-07T20:03:53.3023087Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:53.3023371Z U lrintf@GLIBC_2.2.5 2025-05-07T20:03:53.3023680Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.3023961Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.3024294Z U nearbyint@GLIBC_2.2.5 2025-05-07T20:03:53.3024595Z U nearbyintf@GLIBC_2.2.5 2025-05-07T20:03:53.3024941Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.3025279Z U operator delete[](void*)@GLIBCXX_3.4 2025-05-07T20:03:53.3025648Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.3026007Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.3026368Z U posix_memalign@GLIBC_2.2.5 2025-05-07T20:03:53.3026696Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:53.3027091Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:53.3027607Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:53.3028054Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:53.3028724Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:53.3029460Z U std::__atomic_futex_unsigned_base::_M_futex_notify_all(unsigned int*)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3030436Z U std::__atomic_futex_unsigned_base::_M_futex_wait_until(unsigned int*, unsigned int, bool, std::chrono::duration >, std::chrono::duration >)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3031985Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3033170Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3034217Z U std::__cxx11::basic_string, std::allocator >::compare(char const*) const@GLIBCXX_3.4.21 2025-05-07T20:03:53.3035094Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:53.3035884Z U std::__detail::_Prime_rehash_policy::_M_next_bkt(unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:53.3036401Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:53.3036821Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:53.3037310Z U std::__exception_ptr::exception_ptr::exception_ptr(void*)@CXXABI_1.3.11 2025-05-07T20:03:53.3037826Z U std::__future_base::_Result_base::_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:53.3038399Z U std::__future_base::_Result_base::~_Result_base()@GLIBCXX_3.4.15 2025-05-07T20:03:53.3038810Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:53.3039168Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:53.3039529Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:53.3039867Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:53.3040217Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:53.3040581Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:53.3041019Z U std::__throw_future_error(int)@GLIBCXX_3.4.14 2025-05-07T20:03:53.3041410Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.3041815Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:53.3042241Z U std::bad_alloc::~bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:53.3043073Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3043982Z U std::cerr@GLIBCXX_3.4 2025-05-07T20:03:53.3044290Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:53.3044670Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:53.3045089Z U std::future_category()@GLIBCXX_3.4.15 2025-05-07T20:03:53.3045462Z U std::future_error::~future_error()@GLIBCXX_3.4.14 2025-05-07T20:03:53.3045860Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.3046212Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.3046882Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3047637Z U std::logic_error::logic_error(std::logic_error const&)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3048162Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3048681Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3049228Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3049717Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:53.3050303Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:53.3050638Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:53.3051074Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:53.3051567Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3051993Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:53.3052349Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.3052646Z U stderr@GLIBC_2.2.5 2025-05-07T20:03:53.3052935Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:53.3053203Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.3053481Z U strstr@GLIBC_2.2.5 2025-05-07T20:03:53.3053750Z U tolower@GLIBC_2.2.5 2025-05-07T20:03:53.3054039Z U toupper@GLIBC_2.2.5 2025-05-07T20:03:53.3054391Z U typeinfo for std::__future_base::_Result_base@GLIBCXX_3.4.15 2025-05-07T20:03:53.3054801Z U typeinfo for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:53.3055171Z U typeinfo for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:53.3055527Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:53.3055915Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.3056340Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.3056725Z U vtable for std::bad_alloc@GLIBCXX_3.4 2025-05-07T20:03:53.3057079Z U vtable for std::future_error@GLIBCXX_3.4.14 2025-05-07T20:03:53.3057414Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.3057738Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.3058033Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.3058332Z w __gmon_start__ 2025-05-07T20:03:53.3058592Z w __pthread_key_create 2025-05-07T20:03:53.3058873Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:53.3059218Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:53.3059508Z w pthread_once 2025-05-07T20:03:53.3059766Z w pthread_rwlock_rdlock 2025-05-07T20:03:53.3060064Z w pthread_rwlock_unlock 2025-05-07T20:03:53.3060351Z w pthread_rwlock_wrlock 2025-05-07T20:03:53.3060632Z w pthread_self@GLIBC_2.2.5 2025-05-07T20:03:53.3060971Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.3061356Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.3061620Z 2025-05-07T20:03:53.3061748Z linux-vdso.so.1 (0x00007ffd3cbdd000) 2025-05-07T20:03:53.3062025Z libc10.so => not found 2025-05-07T20:03:53.3062505Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fc0e0a4e000) 2025-05-07T20:03:53.3063050Z libtorch.so => not found 2025-05-07T20:03:53.3063299Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3063541Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3063864Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fc0e019c000) 2025-05-07T20:03:53.3064225Z libm.so.6 => /lib64/libm.so.6 (0x00007fc0e0971000) 2025-05-07T20:03:53.3064597Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fc0e016e000) 2025-05-07T20:03:53.3064956Z libc.so.6 => /lib64/libc.so.6 (0x00007fc0dff66000) 2025-05-07T20:03:53.3065302Z /lib64/ld-linux-x86-64.so.2 (0x00007fc0e0aca000) 2025-05-07T20:03:53.3065602Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3065865Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3066112Z libtorch.so => not found 2025-05-07T20:03:53.3066400Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fc0dff10000) 2025-05-07T20:03:53.3066768Z librt.so.1 => /lib64/librt.so.1 (0x00007fc0e096a000) 2025-05-07T20:03:53.3067150Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fc0e0965000) 2025-05-07T20:03:53.3067417Z 2025-05-07T20:03:53.3067530Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.3067872Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so 2025-05-07T20:03:53.3068132Z 2025-05-07T20:03:53.3074188Z 2025-05-07T20:03:53.3074722Z Dynamic section at offset 0x54e508 contains 37 entries: 2025-05-07T20:03:53.3075887Z Tag Type Name/Value 2025-05-07T20:03:53.3077078Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.3078547Z 0x0000000000000001 (NEEDED) Shared library: [asmjit.so] 2025-05-07T20:03:53.3079884Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.3080400Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.3080927Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.3081438Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.3081951Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:53.3082445Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.3082951Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.3083470Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:53.3083989Z 0x000000000000000e (SONAME) Library soname: [fbgemm.so] 2025-05-07T20:03:53.3084525Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:53.3084920Z 0x000000000000000c (INIT) 0xfd000 2025-05-07T20:03:53.3085258Z 0x000000000000000d (FINI) 0x4c1d18 2025-05-07T20:03:53.3085583Z 0x0000000000000019 (INIT_ARRAY) 0x54b000 2025-05-07T20:03:53.3085943Z 0x000000000000001b (INIT_ARRAYSZ) 1224 (bytes) 2025-05-07T20:03:53.3086296Z 0x000000000000001a (FINI_ARRAY) 0x54b4c8 2025-05-07T20:03:53.3086647Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.3086997Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:53.3087347Z 0x0000000000000005 (STRTAB) 0x24e38 2025-05-07T20:03:53.3087683Z 0x0000000000000006 (SYMTAB) 0x7d68 2025-05-07T20:03:53.3088028Z 0x000000000000000a (STRSZ) 754916 (bytes) 2025-05-07T20:03:53.3088427Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.3088766Z 0x0000000000000003 (PLTGOT) 0x54e798 2025-05-07T20:03:53.3089124Z 0x0000000000000002 (PLTRELSZ) 26136 (bytes) 2025-05-07T20:03:53.3089483Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.3089829Z 0x0000000000000017 (JMPREL) 0xf6768 2025-05-07T20:03:53.3090160Z 0x0000000000000007 (RELA) 0xdfb48 2025-05-07T20:03:53.3090502Z 0x0000000000000008 (RELASZ) 93216 (bytes) 2025-05-07T20:03:53.3090860Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.3091177Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.3091508Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.3091857Z 0x000000006ffffffe (VERNEED) 0xdf9d8 2025-05-07T20:03:53.3092193Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:53.3092521Z 0x000000006ffffff0 (VERSYM) 0xdd31c 2025-05-07T20:03:53.3092847Z 0x000000006ffffff9 (RELACOUNT) 155 2025-05-07T20:03:53.3093154Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.3093356Z 2025-05-07T20:03:53.3093470Z ################################################################################ 2025-05-07T20:03:53.3093710Z 2025-05-07T20:03:53.3093714Z 2025-05-07T20:03:53.3093825Z ################################################################################ 2025-05-07T20:03:53.3094316Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3094809Z [CHECK] Listing out library size: 2025-05-07T20:03:53.3095263Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3095631Z 2025-05-07T20:03:53.3095833Z 2 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3096144Z 2025-05-07T20:03:53.3096527Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3097511Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.3098098Z 2025-05-07T20:03:53.3149227Z GLIBC_2.2.5 2025-05-07T20:03:53.3149882Z GLIBC_2.14 2025-05-07T20:03:53.3150209Z 2025-05-07T20:03:53.3150249Z 2025-05-07T20:03:53.3151451Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3154241Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.3154861Z 2025-05-07T20:03:53.3212092Z GLIBCXX_3.4 2025-05-07T20:03:53.3212553Z GLIBCXX_3.4.9 2025-05-07T20:03:53.3212784Z GLIBCXX_3.4.14 2025-05-07T20:03:53.3212984Z GLIBCXX_3.4.20 2025-05-07T20:03:53.3213187Z GLIBCXX_3.4.21 2025-05-07T20:03:53.3213306Z 2025-05-07T20:03:53.3213324Z 2025-05-07T20:03:53.3230636Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.mhY1ZPbLJr.symbols.txt 2025-05-07T20:03:53.3231151Z 2025-05-07T20:03:53.3260458Z 2025-05-07T20:03:53.3284814Z [CHECK] Total Number of symbols: 540 2025-05-07T20:03:53.3300943Z [CHECK] Number of fbgemm symbols: 48 2025-05-07T20:03:53.3316390Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so > /tmp/tmp.xfOzgyQCTM.usymbols.txt 2025-05-07T20:03:53.3317867Z 2025-05-07T20:03:53.3333698Z 2025-05-07T20:03:53.3357636Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:53.3375118Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.3377297Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.3378330Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.3379015Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.3379570Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.3379961Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:53.3380435Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:53.3380780Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:53.3381181Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.3381516Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:53.3381827Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.3382111Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.3382410Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.3382700Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:53.3383009Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.3383313Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.3383611Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.3383909Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.3384182Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:53.3384478Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.3384947Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:53.3385494Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:53.3385935Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:53.3386796Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.3387659Z U at::_ops::is_nonzero::call(at::Tensor const&) 2025-05-07T20:03:53.3388067Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:53.3388523Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:53.3389143Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:53.3390182Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:53.3390997Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:53.3391748Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.3392511Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:53.3392959Z U at::get_num_threads() 2025-05-07T20:03:53.3393487Z U at::get_thread_num() 2025-05-07T20:03:53.3393807Z U at::internal::set_thread_num(int) 2025-05-07T20:03:53.3394171Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:53.3394508Z U c10::BoolType::get() 2025-05-07T20:03:53.3394870Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.3395521Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:53.3396119Z U c10::Error::what() const 2025-05-07T20:03:53.3396511Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3396943Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.3397382Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.3397759Z U c10::IntType::get() 2025-05-07T20:03:53.3398131Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:53.3398542Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:53.3399015Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.3399579Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:53.3399909Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:53.3400260Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:53.3400628Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.3401249Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:53.3401864Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:53.3402220Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:53.3402565Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.3402896Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:53.3403228Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:53.3403533Z U c10::SymIntType::get() 2025-05-07T20:03:53.3403857Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:53.3404199Z U c10::TensorType::get() 2025-05-07T20:03:53.3404495Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.3405400Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:53.3406313Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:53.3406698Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:53.3407217Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:53.3407892Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:53.3408421Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:53.3408749Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:53.3409063Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:53.3409386Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:53.3409697Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:53.3410140Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:53.3410584Z U c10::cuda::device_count() 2025-05-07T20:03:53.3410891Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:53.3411292Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:53.3411649Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:53.3412018Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:53.3412392Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:53.3412755Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:53.3413454Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.3414355Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.3415174Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.3416304Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.3417355Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.3418343Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:53.3418688Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:53.3419024Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:53.3419391Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:53.3419734Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:53.3420113Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:53.3420502Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:53.3420902Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:53.3421288Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:53.3421638Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:53.3421987Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:53.3422393Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.3422850Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.3423229Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.3423594Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:53.3423949Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:53.3424295Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:53.3424640Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:53.3424989Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.3425386Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:53.3425742Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:53.3426216Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:53.3426581Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:53.3426935Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.3427310Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:53.3428311Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3429971Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3432078Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3433882Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3435635Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3437517Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3439412Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3441250Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3443077Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3445024Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3446743Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3448474Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.3449576Z U fbgemm::fbgemmAlignedAlloc(unsigned long, unsigned long, bool) 2025-05-07T20:03:53.3450167Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:53.3450740Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:53.3451248Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3451674Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.3452264Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3452659Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.3453098Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:53.3453527Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3453927Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.3454283Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.3454582Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.3454874Z U omp_get_max_threads@OMP_1.0 2025-05-07T20:03:53.3455201Z U omp_get_thread_num@OMP_1.0 2025-05-07T20:03:53.3455571Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.3455922Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.3456524Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.3457386Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.3458314Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3459631Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.3460612Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3461506Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3462455Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:53.3463468Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3464202Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:53.3464578Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.3465122Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.3465556Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:53.3466079Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.3467214Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3468051Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:53.3468413Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.3468777Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.3469146Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.3469487Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.3469908Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3470453Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3470952Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:53.3471495Z U std::pair fbgemm::radix_sort_parallel(int*, int*, int*, int*, long, long, bool) 2025-05-07T20:03:53.3472454Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:53.3473697Z U std::pair*> fbgemm::radix_sort_parallel >(int*, std::pair*, int*, std::pair*, long, long, bool) 2025-05-07T20:03:53.3474447Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.3474769Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.3475106Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.3475980Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.3477186Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.3478042Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.3478786Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.3479419Z U typeinfo for c10::Error 2025-05-07T20:03:53.3479755Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:53.3480200Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3480863Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.3481294Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.3481744Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.3482115Z U vtable for c10::Error 2025-05-07T20:03:53.3482702Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.3483394Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.3483853Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.3484194Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.3484504Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.3484815Z w __gmon_start__ 2025-05-07T20:03:53.3485088Z w __pthread_key_create 2025-05-07T20:03:53.3485563Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.3486002Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3486299Z 2025-05-07T20:03:53.3486438Z linux-vdso.so.1 (0x00007ffdc4f86000) 2025-05-07T20:03:53.3486725Z libc10.so => not found 2025-05-07T20:03:53.3486971Z libc10_cuda.so => not found 2025-05-07T20:03:53.3487482Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f505b800000) 2025-05-07T20:03:53.3488356Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f505bf4e000) 2025-05-07T20:03:53.3488968Z libtorch.so => not found 2025-05-07T20:03:53.3489224Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3489481Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3489741Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.3490053Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f505b59c000) 2025-05-07T20:03:53.3490443Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f505bef6000) 2025-05-07T20:03:53.3490829Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f505bec8000) 2025-05-07T20:03:53.3491178Z libc.so.6 => /lib64/libc.so.6 (0x00007f505b394000) 2025-05-07T20:03:53.3491481Z libc10.so => not found 2025-05-07T20:03:53.3491957Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f505be4e000) 2025-05-07T20:03:53.3492488Z libtorch.so => not found 2025-05-07T20:03:53.3492724Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3492982Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3493273Z libm.so.6 => /lib64/libm.so.6 (0x00007f505bd71000) 2025-05-07T20:03:53.3493613Z /lib64/ld-linux-x86-64.so.2 (0x00007f505c153000) 2025-05-07T20:03:53.3493935Z libtorch.so => not found 2025-05-07T20:03:53.3494165Z libc10.so => not found 2025-05-07T20:03:53.3494396Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3494650Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3494987Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f505bd6a000) 2025-05-07T20:03:53.3495346Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3495653Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3495898Z libtorch.so => not found 2025-05-07T20:03:53.3496179Z librt.so.1 => /lib64/librt.so.1 (0x00007f505bd65000) 2025-05-07T20:03:53.3496406Z 2025-05-07T20:03:53.3496520Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.3496924Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so 2025-05-07T20:03:53.3497252Z 2025-05-07T20:03:53.3497256Z 2025-05-07T20:03:53.3497404Z Dynamic section at offset 0x189ef8 contains 39 entries: 2025-05-07T20:03:53.3497759Z Tag Type Name/Value 2025-05-07T20:03:53.3498186Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.3498674Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:53.3499149Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:53.3499678Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:53.3500174Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.3500661Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.3501172Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.3501674Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:53.3502173Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.3502647Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.3503131Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.3503593Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.3504088Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:53.3504574Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:53.3504963Z 0x000000000000000c (INIT) 0x16000 2025-05-07T20:03:53.3505281Z 0x000000000000000d (FINI) 0x60bac 2025-05-07T20:03:53.3505595Z 0x0000000000000019 (INIT_ARRAY) 0x189258 2025-05-07T20:03:53.3505931Z 0x000000000000001b (INIT_ARRAYSZ) 72 (bytes) 2025-05-07T20:03:53.3506258Z 0x000000000000001a (FINI_ARRAY) 0x1892a0 2025-05-07T20:03:53.3506590Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.3506910Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.3507229Z 0x0000000000000005 (STRTAB) 0x4598 2025-05-07T20:03:53.3507532Z 0x0000000000000006 (SYMTAB) 0x12e0 2025-05-07T20:03:53.3507867Z 0x000000000000000a (STRSZ) 47880 (bytes) 2025-05-07T20:03:53.3508210Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.3508534Z 0x0000000000000003 (PLTGOT) 0x18a1a8 2025-05-07T20:03:53.3508877Z 0x0000000000000002 (PLTRELSZ) 9240 (bytes) 2025-05-07T20:03:53.3509199Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.3509515Z 0x0000000000000017 (JMPREL) 0x131f0 2025-05-07T20:03:53.3509820Z 0x0000000000000007 (RELA) 0x105e0 2025-05-07T20:03:53.3510157Z 0x0000000000000008 (RELASZ) 11280 (bytes) 2025-05-07T20:03:53.3510501Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.3510802Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.3511118Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.3511433Z 0x000000006ffffffe (VERNEED) 0x104e0 2025-05-07T20:03:53.3511736Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:53.3512183Z 0x000000006ffffff0 (VERSYM) 0x100a0 2025-05-07T20:03:53.3512494Z 0x000000006ffffff9 (RELACOUNT) 245 2025-05-07T20:03:53.3512864Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.3513240Z 2025-05-07T20:03:53.3513399Z ################################################################################ 2025-05-07T20:03:53.3513632Z 2025-05-07T20:03:53.3513636Z 2025-05-07T20:03:53.3513764Z ################################################################################ 2025-05-07T20:03:53.3514306Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3514790Z [CHECK] Listing out library size: 2025-05-07T20:03:53.3515231Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3515603Z 2025-05-07T20:03:53.3515793Z 8 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3516119Z 2025-05-07T20:03:53.3516512Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3517476Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.3518103Z 2025-05-07T20:03:53.3523899Z GLIBC_2.2.5 2025-05-07T20:03:53.3524474Z GLIBC_2.14 2025-05-07T20:03:53.3525247Z 2025-05-07T20:03:53.3525370Z 2025-05-07T20:03:53.3527552Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3530621Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.3532894Z 2025-05-07T20:03:53.3587803Z GLIBCXX_3.4 2025-05-07T20:03:53.3588478Z GLIBCXX_3.4.9 2025-05-07T20:03:53.3589096Z GLIBCXX_3.4.20 2025-05-07T20:03:53.3589679Z GLIBCXX_3.4.21 2025-05-07T20:03:53.3590029Z 2025-05-07T20:03:53.3590042Z 2025-05-07T20:03:53.3604510Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.7QuWx8x8QQ.symbols.txt 2025-05-07T20:03:53.3605987Z 2025-05-07T20:03:53.3633946Z 2025-05-07T20:03:53.3658935Z [CHECK] Total Number of symbols: 501 2025-05-07T20:03:53.3675874Z [CHECK] Number of fbgemm symbols: 13 2025-05-07T20:03:53.3699743Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so > /tmp/tmp.H32t5b3G0q.usymbols.txt 2025-05-07T20:03:53.3700259Z 2025-05-07T20:03:53.3715171Z 2025-05-07T20:03:53.3738261Z [CHECK] Listing out undefined symbols (154 total): 2025-05-07T20:03:53.3755172Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.3756976Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.3757993Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.3759208Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.3760150Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.3760547Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:53.3760933Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:53.3761302Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:53.3761681Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.3762047Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:53.3762383Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.3762690Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.3763010Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.3763320Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.3763640Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.3763960Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.3764285Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.3764593Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:53.3764921Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.3765563Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:53.3766092Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:53.3766999Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.3768381Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.3769494Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:53.3770063Z U at::_ops::mul_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:53.3770565Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:53.3771046Z U at::_ops::sub__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:53.3771487Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:53.3772188Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.3773309Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.3774080Z U c10::BoolType::get() 2025-05-07T20:03:53.3774429Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.3774845Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.3775205Z U c10::IntType::get() 2025-05-07T20:03:53.3775555Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:53.3775958Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:53.3776387Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.3776872Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.3777272Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.3777914Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:53.3778557Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:53.3778906Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.3779268Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:53.3779587Z U c10::SymIntType::get() 2025-05-07T20:03:53.3779966Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:53.3780402Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.3780767Z U c10::TensorType::get() 2025-05-07T20:03:53.3781110Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.3782021Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:53.3782963Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:53.3783342Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:53.3783684Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:53.3784046Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:53.3784383Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:53.3784772Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:53.3785264Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:53.3785735Z U c10::cuda::current_device() 2025-05-07T20:03:53.3786074Z U c10::cuda::device_count() 2025-05-07T20:03:53.3786394Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:53.3786767Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:53.3787131Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:53.3787566Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:53.3787961Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:53.3788323Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:53.3789069Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.3789901Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.3790748Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.3791657Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.3792927Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.3794006Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:53.3794377Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:53.3794754Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:53.3795212Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:53.3795617Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:53.3796006Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:53.3796416Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:53.3796781Z U c10::throwNullDataPtrError() 2025-05-07T20:03:53.3797129Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:53.3797457Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:53.3797891Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.3798343Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:53.3798710Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:53.3799102Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.3799490Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.3799877Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:53.3800236Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:53.3800608Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:53.3800958Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:53.3801329Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.3801714Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:53.3802072Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:53.3802438Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:53.3802813Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:53.3803203Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:53.3803556Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:53.3803954Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:53.3804318Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:53.3804847Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.3805404Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:53.3805756Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:53.3806116Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:53.3806473Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.3806921Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:53.3807413Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3807884Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.3808258Z U log2@GLIBC_2.2.5 2025-05-07T20:03:53.3808608Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:53.3809027Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.3809401Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.3809775Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.3810062Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.3810350Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.3810690Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.3811001Z U printf@GLIBC_2.2.5 2025-05-07T20:03:53.3811285Z U puts@GLIBC_2.2.5 2025-05-07T20:03:53.3811790Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.3812604Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.3813641Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3814632Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.3815622Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3816493Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3817414Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:53.3818453Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.3819192Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.3819567Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.3819980Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:53.3820490Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.3821369Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3822158Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:53.3822504Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.3822878Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.3823227Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.3823548Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.3824179Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3824724Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.3839110Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:53.3839585Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.3840021Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.3840333Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.3841173Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.3842396Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.3843277Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.3844023Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.3844632Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:53.3845025Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.3845443Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.3845877Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.3846496Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.3847166Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.3847624Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.3847940Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.3848255Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.3848544Z w __gmon_start__ 2025-05-07T20:03:53.3848818Z w __pthread_key_create 2025-05-07T20:03:53.3849157Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.3849613Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3849927Z 2025-05-07T20:03:53.3850080Z linux-vdso.so.1 (0x00007ffcaa3df000) 2025-05-07T20:03:53.3850372Z libtorch.so => not found 2025-05-07T20:03:53.3850623Z libc10.so => not found 2025-05-07T20:03:53.3850867Z libc10_cuda.so => not found 2025-05-07T20:03:53.3851125Z libtorch_cpu.so => not found 2025-05-07T20:03:53.3851400Z libtorch_cuda.so => not found 2025-05-07T20:03:53.3851662Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.3852001Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fb842b9c000) 2025-05-07T20:03:53.3852390Z libm.so.6 => /lib64/libm.so.6 (0x00007fb842ac1000) 2025-05-07T20:03:53.3852778Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fb8436f3000) 2025-05-07T20:03:53.3853271Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb8436c5000) 2025-05-07T20:03:53.3853630Z libc.so.6 => /lib64/libc.so.6 (0x00007fb8428b9000) 2025-05-07T20:03:53.3853966Z /lib64/ld-linux-x86-64.so.2 (0x00007fb84374f000) 2025-05-07T20:03:53.3854185Z 2025-05-07T20:03:53.3854284Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.3854682Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so 2025-05-07T20:03:53.3854994Z 2025-05-07T20:03:53.3854998Z 2025-05-07T20:03:53.3855142Z Dynamic section at offset 0x7de050 contains 37 entries: 2025-05-07T20:03:53.3855497Z Tag Type Name/Value 2025-05-07T20:03:53.3855936Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.3856398Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.3856874Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:53.3857351Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.3857844Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.3858329Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:53.3858844Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.3859317Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:53.3859770Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.3861643Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.3862104Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.3862595Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:53.3863041Z 0x000000000000000c (INIT) 0x14000 2025-05-07T20:03:53.3863352Z 0x000000000000000d (FINI) 0x5fb3c 2025-05-07T20:03:53.3863665Z 0x0000000000000019 (INIT_ARRAY) 0x7dd548 2025-05-07T20:03:53.3863977Z 0x000000000000001b (INIT_ARRAYSZ) 96 (bytes) 2025-05-07T20:03:53.3864300Z 0x000000000000001a (FINI_ARRAY) 0x7dd5a8 2025-05-07T20:03:53.3864615Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.3864936Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.3865233Z 0x0000000000000005 (STRTAB) 0x4240 2025-05-07T20:03:53.3865533Z 0x0000000000000006 (SYMTAB) 0x1330 2025-05-07T20:03:53.3865850Z 0x000000000000000a (STRSZ) 43494 (bytes) 2025-05-07T20:03:53.3866188Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.3866506Z 0x0000000000000003 (PLTGOT) 0x7de2f0 2025-05-07T20:03:53.3866828Z 0x0000000000000002 (PLTRELSZ) 6432 (bytes) 2025-05-07T20:03:53.3867152Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.3867443Z 0x0000000000000017 (JMPREL) 0x11f88 2025-05-07T20:03:53.3867748Z 0x0000000000000007 (RELA) 0xf108 2025-05-07T20:03:53.3868058Z 0x0000000000000008 (RELASZ) 11904 (bytes) 2025-05-07T20:03:53.3868389Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.3868682Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.3868985Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.3869306Z 0x000000006ffffffe (VERNEED) 0xf018 2025-05-07T20:03:53.3869605Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:53.3869910Z 0x000000006ffffff0 (VERSYM) 0xec26 2025-05-07T20:03:53.3870208Z 0x000000006ffffff9 (RELACOUNT) 116 2025-05-07T20:03:53.3870495Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.3870679Z 2025-05-07T20:03:53.3870784Z ################################################################################ 2025-05-07T20:03:53.3871006Z 2025-05-07T20:03:53.3871010Z 2025-05-07T20:03:53.3871116Z ################################################################################ 2025-05-07T20:03:53.3871616Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.3872091Z [CHECK] Listing out library size: 2025-05-07T20:03:53.3872544Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.3873191Z 2025-05-07T20:03:53.3873414Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.3873766Z 2025-05-07T20:03:53.3874228Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.3875301Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.3875933Z 2025-05-07T20:03:53.3890440Z GLIBC_2.2.5 2025-05-07T20:03:53.3891099Z GLIBC_2.14 2025-05-07T20:03:53.3891432Z 2025-05-07T20:03:53.3891445Z 2025-05-07T20:03:53.3892837Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.3893934Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.3894709Z 2025-05-07T20:03:53.3940827Z GLIBCXX_3.4 2025-05-07T20:03:53.3941468Z GLIBCXX_3.4.9 2025-05-07T20:03:53.3942360Z GLIBCXX_3.4.21 2025-05-07T20:03:53.3942738Z 2025-05-07T20:03:53.3942752Z 2025-05-07T20:03:53.3959611Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.GmZVs2cKC1.symbols.txt 2025-05-07T20:03:53.3961112Z 2025-05-07T20:03:53.3980260Z 2025-05-07T20:03:53.4002651Z [CHECK] Total Number of symbols: 274 2025-05-07T20:03:53.4017100Z [CHECK] Number of fbgemm symbols: 44 2025-05-07T20:03:53.4034879Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so > /tmp/tmp.WJNtrHO68B.usymbols.txt 2025-05-07T20:03:53.4036433Z 2025-05-07T20:03:53.4052593Z 2025-05-07T20:03:53.4075131Z [CHECK] Listing out undefined symbols (130 total): 2025-05-07T20:03:53.4090657Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.4092448Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.4093179Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.4093589Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.4093995Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.4094383Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:53.4094762Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:53.4095125Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:53.4095486Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.4095840Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.4096145Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.4096463Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.4096776Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.4097102Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.4097425Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.4097834Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:53.4098744Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.4100106Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.4101151Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:53.4101558Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:53.4102246Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.4102939Z U at::get_thread_num() 2025-05-07T20:03:53.4103251Z U at::internal::set_thread_num(int) 2025-05-07T20:03:53.4104270Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.4105193Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:53.4105733Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.4106152Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.4106593Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.4106967Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.4107329Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:53.4107699Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:53.4108057Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.4108450Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.4108785Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.4109199Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:53.4109585Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.4109937Z U c10::TensorType::get() 2025-05-07T20:03:53.4110244Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.4111126Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:53.4112036Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:53.4112376Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:53.4112813Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:53.4113317Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:53.4113649Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:53.4114083Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:53.4114552Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:53.4115030Z U c10::cuda::device_count() 2025-05-07T20:03:53.4115378Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:53.4115757Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:53.4116151Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:53.4116536Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:53.4116952Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:53.4117337Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:53.4118077Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.4118972Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.4119841Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.4120798Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.4121370Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:53.4121694Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:53.4122070Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:53.4122530Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:53.4122937Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:53.4123312Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:53.4123705Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:53.4124069Z U c10::throwNullDataPtrError() 2025-05-07T20:03:53.4124397Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:53.4124716Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:53.4125158Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.4125666Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:53.4126000Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:53.4126370Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.4126712Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.4127049Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:53.4127369Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:53.4127738Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:53.4128223Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:53.4128570Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.4128928Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:53.4129261Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:53.4129603Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:53.4129937Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:53.4130299Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:53.4130799Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:53.4131483Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.4132019Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:53.4132138Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:53.4132259Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:53.4132389Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.4132520Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:53.4132664Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.4132795Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.4132981Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:53.4133113Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.4133211Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.4133321Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.4133443Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.4133565Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.4133667Z U printf@GLIBC_2.2.5 2025-05-07T20:03:53.4133774Z U puts@GLIBC_2.2.5 2025-05-07T20:03:53.4134127Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.4134527Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.4135093Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.4135572Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.4136115Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.4136263Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.4136405Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.4136670Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.4137257Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.4137417Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.4137583Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.4137706Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.4137818Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.4138024Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.4138174Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.4138270Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.4138409Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.4139004Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.4139477Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.4139745Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.4140112Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.4140270Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.4140446Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.4140599Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.4140937Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.4141186Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.4141295Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.4141400Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.4141510Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.4141597Z w __gmon_start__ 2025-05-07T20:03:53.4141690Z w __pthread_key_create 2025-05-07T20:03:53.4141839Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.4142076Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.4142084Z 2025-05-07T20:03:53.4142207Z linux-vdso.so.1 (0x00007fffed7fe000) 2025-05-07T20:03:53.4142306Z libc10.so => not found 2025-05-07T20:03:53.4142397Z libc10_cuda.so => not found 2025-05-07T20:03:53.4142484Z libtorch.so => not found 2025-05-07T20:03:53.4142573Z libtorch_cpu.so => not found 2025-05-07T20:03:53.4142671Z libtorch_cuda.so => not found 2025-05-07T20:03:53.4142761Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.4142922Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fdf7eb30000) 2025-05-07T20:03:53.4143079Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fdf7eada000) 2025-05-07T20:03:53.4143230Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fdf7eaac000) 2025-05-07T20:03:53.4143378Z libc.so.6 => /lib64/libc.so.6 (0x00007fdf7e8a4000) 2025-05-07T20:03:53.4143503Z libm.so.6 => /lib64/libm.so.6 (0x00007fdf7e7c9000) 2025-05-07T20:03:53.4143639Z /lib64/ld-linux-x86-64.so.2 (0x00007fdf7ee92000) 2025-05-07T20:03:53.4143644Z 2025-05-07T20:03:53.4143750Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.4144013Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so 2025-05-07T20:03:53.4144018Z 2025-05-07T20:03:53.4160864Z 2025-05-07T20:03:53.4161488Z Dynamic section at offset 0xc06b8 contains 37 entries: 2025-05-07T20:03:53.4161872Z Tag Type Name/Value 2025-05-07T20:03:53.4162274Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.4162470Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:53.4162664Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.4162899Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.4163102Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.4163311Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:53.4163531Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.4163732Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:53.4163924Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.4164109Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.4164367Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:53.4164546Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:53.4164658Z 0x000000000000000c (INIT) 0xa000 2025-05-07T20:03:53.4164773Z 0x000000000000000d (FINI) 0x1813c 2025-05-07T20:03:53.4164887Z 0x0000000000000019 (INIT_ARRAY) 0xc13b0 2025-05-07T20:03:53.4165005Z 0x000000000000001b (INIT_ARRAYSZ) 32 (bytes) 2025-05-07T20:03:53.4165124Z 0x000000000000001a (FINI_ARRAY) 0xc13d0 2025-05-07T20:03:53.4165244Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.4165351Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.4165454Z 0x0000000000000005 (STRTAB) 0x22f0 2025-05-07T20:03:53.4165567Z 0x0000000000000006 (SYMTAB) 0x928 2025-05-07T20:03:53.4165694Z 0x000000000000000a (STRSZ) 20379 (bytes) 2025-05-07T20:03:53.4165813Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.4165929Z 0x0000000000000003 (PLTGOT) 0xc1948 2025-05-07T20:03:53.4166056Z 0x0000000000000002 (PLTRELSZ) 3936 (bytes) 2025-05-07T20:03:53.4166159Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.4166264Z 0x0000000000000017 (JMPREL) 0x8298 2025-05-07T20:03:53.4166379Z 0x0000000000000007 (RELA) 0x7578 2025-05-07T20:03:53.4166502Z 0x0000000000000008 (RELASZ) 3360 (bytes) 2025-05-07T20:03:53.4166621Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.4166727Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.4166848Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.4166956Z 0x000000006ffffffe (VERNEED) 0x74b8 2025-05-07T20:03:53.4167070Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:53.4167180Z 0x000000006ffffff0 (VERSYM) 0x728c 2025-05-07T20:03:53.4167279Z 0x000000006ffffff9 (RELACOUNT) 7 2025-05-07T20:03:53.4167378Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.4167404Z 2025-05-07T20:03:53.4167517Z ################################################################################ 2025-05-07T20:03:53.4167523Z 2025-05-07T20:03:53.4167527Z 2025-05-07T20:03:53.4167683Z ################################################################################ 2025-05-07T20:03:53.4168025Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.4168126Z [CHECK] Listing out library size: 2025-05-07T20:03:53.4168440Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.4168444Z 2025-05-07T20:03:53.4175336Z 11 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.4175561Z 2025-05-07T20:03:53.4177476Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.4179204Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.4179318Z 2025-05-07T20:03:53.4635118Z GLIBC_2.2.5 2025-05-07T20:03:53.4635371Z GLIBC_2.3 2025-05-07T20:03:53.4635618Z GLIBC_2.14 2025-05-07T20:03:53.4635636Z 2025-05-07T20:03:53.4635660Z 2025-05-07T20:03:53.4637356Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.4639073Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.4639090Z 2025-05-07T20:03:53.5094320Z GLIBCXX_3.4 2025-05-07T20:03:53.5094948Z GLIBCXX_3.4.9 2025-05-07T20:03:53.5095082Z GLIBCXX_3.4.11 2025-05-07T20:03:53.5095205Z GLIBCXX_3.4.15 2025-05-07T20:03:53.5095303Z GLIBCXX_3.4.18 2025-05-07T20:03:53.5095390Z GLIBCXX_3.4.20 2025-05-07T20:03:53.5095474Z GLIBCXX_3.4.21 2025-05-07T20:03:53.5095480Z 2025-05-07T20:03:53.5095525Z 2025-05-07T20:03:53.5115802Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.c8juHKjPSR.symbols.txt 2025-05-07T20:03:53.5115876Z 2025-05-07T20:03:53.5520867Z 2025-05-07T20:03:53.5549829Z [CHECK] Total Number of symbols: 4395 2025-05-07T20:03:53.5578800Z [CHECK] Number of fbgemm symbols: 4 2025-05-07T20:03:53.5595927Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so > /tmp/tmp.AyL0aCUeMw.usymbols.txt 2025-05-07T20:03:53.5595978Z 2025-05-07T20:03:53.5623411Z 2025-05-07T20:03:53.5649110Z [CHECK] Listing out undefined symbols (192 total): 2025-05-07T20:03:53.5666400Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.5667630Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.5667931Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.5668287Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:53.5668589Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.5668906Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.5669201Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.5669506Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:53.5669818Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.5670117Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.5670413Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.5670707Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.5670949Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:53.5671064Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.5671163Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:53.5671356Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:53.5671492Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:53.5671602Z U at::RecordFunction::end() 2025-05-07T20:03:53.5671950Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:53.5672111Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:53.5672427Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:53.5672920Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:53.5673256Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:53.5673587Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:53.5674272Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.5674524Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:53.5674701Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:53.5674873Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:53.5675046Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:53.5675188Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:53.5675284Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:53.5675388Z U c10::AnyType::get() 2025-05-07T20:03:53.5675488Z U c10::BoolType::get() 2025-05-07T20:03:53.5675662Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.5675849Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:53.5675970Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:53.5676515Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:53.5677172Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:53.5677571Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:53.5677674Z U c10::Error::what() const 2025-05-07T20:03:53.5677781Z U c10::FloatType::get() 2025-05-07T20:03:53.5677891Z U c10::GradMode::is_enabled() 2025-05-07T20:03:53.5678015Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:53.5678178Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:53.5678297Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:53.5678423Z U c10::IValue::isBoolList() const 2025-05-07T20:03:53.5678539Z U c10::IValue::isDoubleList() const 2025-05-07T20:03:53.5678650Z U c10::IValue::isIntList() const 2025-05-07T20:03:53.5678780Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:53.5678892Z U c10::IValue::isTensorList() const 2025-05-07T20:03:53.5679039Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.5679150Z U c10::IntType::get() 2025-05-07T20:03:53.5679638Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.5679815Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:53.5679955Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:53.5680115Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:53.5680242Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:53.5680487Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.5680777Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:53.5680884Z U c10::StringType::get() 2025-05-07T20:03:53.5681060Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:53.5681213Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.5681405Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:53.5681569Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:53.5682014Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:53.5682196Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:53.5682337Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:53.5682529Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:53.5682662Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.5682807Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:53.5682945Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:53.5683059Z U c10::SymIntType::get() 2025-05-07T20:03:53.5683193Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:53.5683323Z U c10::TensorType::get() 2025-05-07T20:03:53.5683452Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.5683897Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.5684451Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.5684718Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.5685348Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.5685693Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.5686253Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.5686581Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:53.5686763Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:53.5686884Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:53.5687052Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:53.5687404Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:53.5687525Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:53.5687700Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:53.5687849Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:53.5687989Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:53.5688196Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.5688409Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:53.5688662Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:53.5688956Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:53.5689052Z U free@GLIBC_2.2.5 2025-05-07T20:03:53.5689226Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:53.5689324Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:53.5689437Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.5689560Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:53.5689654Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.5689786Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.5689930Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.5690027Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:53.5690251Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:53.5690608Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.5690986Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.5691390Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5691912Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.5692282Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5692689Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5693181Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5693537Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5694074Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5694397Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:53.5694784Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.5695141Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:53.5695256Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:53.5695386Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:53.5695528Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.5695666Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.5695856Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:53.5695989Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:53.5696128Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:53.5696404Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.5696962Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.5697089Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:53.5697226Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.5697348Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.5697463Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.5697626Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.5697807Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.5698039Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.5699428Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:53.5699594Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5699729Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:53.5700198Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:53.5700338Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:53.5700451Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.5700570Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:53.5700664Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.5700786Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.5701372Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.5701820Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.5702070Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.5702209Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:53.5702494Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:53.5702672Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:53.5702889Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:53.5703071Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:53.5703410Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:53.5703573Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:53.5703760Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:53.5703934Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:53.5704073Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:53.5704186Z U torch::autograd::Node::metadata() 2025-05-07T20:03:53.5704321Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:53.5704577Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:53.5704838Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:53.5704978Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:53.5705227Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:53.5705443Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:53.5708005Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:53.5708209Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:53.5708372Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:53.5708558Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:53.5709312Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:53.5709482Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:53.5709883Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:53.5710234Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.5710357Z U typeinfo for c10::Error 2025-05-07T20:03:53.5710496Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:53.5710623Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:53.5710765Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:53.5710895Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:53.5711013Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:53.5711176Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.5711337Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.5711490Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:53.5711662Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.5711765Z U vtable for c10::Error 2025-05-07T20:03:53.5712274Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.5712432Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:53.5712776Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.5712899Z U vtable for torch::autograd::Node 2025-05-07T20:03:53.5713284Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:53.5713411Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.5713524Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.5713632Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.5713749Z w __gmon_start__ 2025-05-07T20:03:53.5713856Z w __pthread_key_create 2025-05-07T20:03:53.5714045Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:53.5714220Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:53.5714374Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.5714654Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.5714662Z 2025-05-07T20:03:53.5714836Z linux-vdso.so.1 (0x00007ffc6cdc1000) 2025-05-07T20:03:53.5714934Z libc10.so => not found 2025-05-07T20:03:53.5715433Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f435e60a000) 2025-05-07T20:03:53.5715585Z libtorch.so => not found 2025-05-07T20:03:53.5716041Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f435f3cb000) 2025-05-07T20:03:53.5716146Z libtorch_cpu.so => not found 2025-05-07T20:03:53.5716301Z libtorch_cuda.so => not found 2025-05-07T20:03:53.5716468Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f435e3a6000) 2025-05-07T20:03:53.5716625Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f435f39b000) 2025-05-07T20:03:53.5716756Z libc.so.6 => /lib64/libc.so.6 (0x00007f435e19e000) 2025-05-07T20:03:53.5716951Z /lib64/ld-linux-x86-64.so.2 (0x00007f435f3da000) 2025-05-07T20:03:53.5717046Z libc10.so => not found 2025-05-07T20:03:53.5717151Z libc10_cuda.so => not found 2025-05-07T20:03:53.5717540Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f435dc00000) 2025-05-07T20:03:53.5717639Z libtorch.so => not found 2025-05-07T20:03:53.5717741Z libtorch_cpu.so => not found 2025-05-07T20:03:53.5717863Z libtorch_cuda.so => not found 2025-05-07T20:03:53.5717966Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.5718134Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f435f343000) 2025-05-07T20:03:53.5718223Z libtorch.so => not found 2025-05-07T20:03:53.5718327Z libc10.so => not found 2025-05-07T20:03:53.5718422Z libtorch_cpu.so => not found 2025-05-07T20:03:53.5718518Z libtorch_cuda.so => not found 2025-05-07T20:03:53.5718702Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f435f33c000) 2025-05-07T20:03:53.5718828Z libm.so.6 => /lib64/libm.so.6 (0x00007f435f25f000) 2025-05-07T20:03:53.5718922Z libc10.so => not found 2025-05-07T20:03:53.5719281Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f435db88000) 2025-05-07T20:03:53.5719390Z libtorch.so => not found 2025-05-07T20:03:53.5719490Z libtorch_cpu.so => not found 2025-05-07T20:03:53.5719585Z libtorch_cuda.so => not found 2025-05-07T20:03:53.5719696Z libtorch_cpu.so => not found 2025-05-07T20:03:53.5719795Z libtorch_cuda.so => not found 2025-05-07T20:03:53.5719884Z libtorch.so => not found 2025-05-07T20:03:53.5720016Z librt.so.1 => /lib64/librt.so.1 (0x00007f435f258000) 2025-05-07T20:03:53.5720045Z 2025-05-07T20:03:53.5720159Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.5720452Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so 2025-05-07T20:03:53.5720457Z 2025-05-07T20:03:53.5755147Z 2025-05-07T20:03:53.5756203Z Dynamic section at offset 0xa3d920 contains 37 entries: 2025-05-07T20:03:53.5756367Z Tag Type Name/Value 2025-05-07T20:03:53.5756568Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.5756811Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:53.5757011Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.5757231Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:53.5757451Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.5757653Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.5757858Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.5758244Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.5758466Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.5758688Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:53.5758980Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_pt2.so] 2025-05-07T20:03:53.5759165Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:53.5759283Z 0x000000000000000c (INIT) 0x189000 2025-05-07T20:03:53.5759403Z 0x000000000000000d (FINI) 0x8a73b8 2025-05-07T20:03:53.5759579Z 0x0000000000000019 (INIT_ARRAY) 0xa32f68 2025-05-07T20:03:53.5759709Z 0x000000000000001b (INIT_ARRAYSZ) 256 (bytes) 2025-05-07T20:03:53.5759823Z 0x000000000000001a (FINI_ARRAY) 0xa33068 2025-05-07T20:03:53.5760002Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.5760120Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.5760229Z 0x0000000000000005 (STRTAB) 0x20fc8 2025-05-07T20:03:53.5760348Z 0x0000000000000006 (SYMTAB) 0x73a8 2025-05-07T20:03:53.5760534Z 0x000000000000000a (STRSZ) 1247927 (bytes) 2025-05-07T20:03:53.5760655Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.5760776Z 0x0000000000000003 (PLTGOT) 0xa3ebb0 2025-05-07T20:03:53.5760910Z 0x0000000000000002 (PLTRELSZ) 42648 (bytes) 2025-05-07T20:03:53.5761015Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.5761131Z 0x0000000000000017 (JMPREL) 0x17dc38 2025-05-07T20:03:53.5761249Z 0x0000000000000007 (RELA) 0x153de8 2025-05-07T20:03:53.5761384Z 0x0000000000000008 (RELASZ) 171600 (bytes) 2025-05-07T20:03:53.5761499Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.5761607Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.5761733Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.5761850Z 0x000000006ffffffe (VERNEED) 0x153cd8 2025-05-07T20:03:53.5761964Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:53.5762079Z 0x000000006ffffff0 (VERSYM) 0x151a80 2025-05-07T20:03:53.5762183Z 0x000000006ffffff9 (RELACOUNT) 34 2025-05-07T20:03:53.5762279Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.5762285Z 2025-05-07T20:03:53.5762409Z ################################################################################ 2025-05-07T20:03:53.5762414Z 2025-05-07T20:03:53.5762420Z 2025-05-07T20:03:53.5762531Z ################################################################################ 2025-05-07T20:03:53.5762814Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.5762929Z [CHECK] Listing out library size: 2025-05-07T20:03:53.5763203Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.5763208Z 2025-05-07T20:03:53.5767501Z 211 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.5767795Z 2025-05-07T20:03:53.5769975Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.5771186Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.5771194Z 2025-05-07T20:03:53.6160246Z GLIBC_2.2.5 2025-05-07T20:03:53.6160470Z GLIBC_2.14 2025-05-07T20:03:53.6160546Z 2025-05-07T20:03:53.6160551Z 2025-05-07T20:03:53.6161028Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.6161660Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.6161852Z 2025-05-07T20:03:53.6556994Z GLIBCXX_3.4 2025-05-07T20:03:53.6557299Z GLIBCXX_3.4.9 2025-05-07T20:03:53.6557533Z GLIBCXX_3.4.11 2025-05-07T20:03:53.6557758Z GLIBCXX_3.4.14 2025-05-07T20:03:53.6558007Z GLIBCXX_3.4.18 2025-05-07T20:03:53.6558242Z GLIBCXX_3.4.20 2025-05-07T20:03:53.6558460Z GLIBCXX_3.4.21 2025-05-07T20:03:53.6558490Z 2025-05-07T20:03:53.6558503Z 2025-05-07T20:03:53.6579965Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.MBWO4lTdAr.symbols.txt 2025-05-07T20:03:53.6580012Z 2025-05-07T20:03:53.6939472Z 2025-05-07T20:03:53.6964693Z [CHECK] Total Number of symbols: 5040 2025-05-07T20:03:53.6993181Z [CHECK] Number of fbgemm symbols: 3788 2025-05-07T20:03:53.7010873Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so > /tmp/tmp.CS9ASr2vfs.usymbols.txt 2025-05-07T20:03:53.7011252Z 2025-05-07T20:03:53.7039074Z 2025-05-07T20:03:53.7063720Z [CHECK] Listing out undefined symbols (253 total): 2025-05-07T20:03:53.7081125Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.7082042Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.7082596Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:53.7082949Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.7083374Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:53.7083780Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.7084179Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:53.7084571Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:53.7084926Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:53.7085317Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:53.7085747Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:53.7086071Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:53.7086389Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:53.7086720Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:53.7087036Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:53.7087376Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:53.7087694Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:53.7088025Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:53.7088340Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:53.7088639Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:53.7088944Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:53.7089361Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:53.7090244Z U at::_ops::arange_start::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.7091499Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.7092869Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.7093823Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:53.7094604Z U at::_ops::scalar_tensor::call(c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.7095446Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:53.7096063Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:53.7097243Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:53.7098407Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.7099107Z U at::detail::getCUDAHooks() 2025-05-07T20:03:53.7099398Z U at::detail::getHIPHooks() 2025-05-07T20:03:53.7099683Z U at::get_thread_num() 2025-05-07T20:03:53.7100071Z U at::globalContext() 2025-05-07T20:03:53.7100370Z U at::internal::set_thread_num(int) 2025-05-07T20:03:53.7100746Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:53.7101176Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.7101671Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.7102100Z U c10::ClassType::addMethod(torch::jit::Function*) 2025-05-07T20:03:53.7102692Z U c10::ClassType::getMethod(std::__cxx11::basic_string, std::allocator > const&) const 2025-05-07T20:03:53.7103303Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:53.7104174Z U c10::DictType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.7105258Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:53.7105816Z U c10::Error::what() const 2025-05-07T20:03:53.7106104Z U c10::GradMode::is_enabled() 2025-05-07T20:03:53.7106408Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:53.7106745Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.7107161Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.7107581Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:53.7107946Z U c10::IValue::is(c10::IValue const&) const 2025-05-07T20:03:53.7108277Z U c10::IValue::isTensorList() const 2025-05-07T20:03:53.7108610Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:53.7108946Z U c10::IntType::get() 2025-05-07T20:03:53.7109593Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.7110299Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:53.7110683Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:53.7110983Z U c10::NoneType::get() 2025-05-07T20:03:53.7111381Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.7111814Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:53.7112157Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:53.7112529Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:53.7112983Z U c10::StringType::get() 2025-05-07T20:03:53.7113513Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:53.7114024Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:53.7114709Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:53.7115376Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:53.7115743Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:53.7116129Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:53.7116838Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:53.7117535Z U c10::TensorType::get() 2025-05-07T20:03:53.7118563Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:53.7119626Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:53.7120629Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:53.7121701Z U c10::_fastEqualsForContainer(c10::IValue const&, c10::IValue const&) 2025-05-07T20:03:53.7122173Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:53.7122553Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:53.7122902Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:53.7123254Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:53.7123603Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:53.7123948Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:53.7124433Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:53.7124903Z U c10::cuda::device_count() 2025-05-07T20:03:53.7125258Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:53.7125752Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:53.7126109Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:53.7126483Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:53.7126853Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:53.7127222Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:53.7127830Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:53.7128831Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.7130395Z U c10::detail::infer_schema::make_function_schema(std::__cxx11::basic_string, std::allocator >&&, std::__cxx11::basic_string, std::allocator >&&, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:53.7132233Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:53.7133110Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.7134081Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:53.7135207Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.7136082Z U c10::getCustomClassTypeImpl(std::type_index const&) 2025-05-07T20:03:53.7136469Z U c10::get_default_dtype() 2025-05-07T20:03:53.7136973Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:53.7137563Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:53.7138005Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:53.7138381Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:53.7138729Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:53.7139329Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:53.7140028Z U c10::ivalue::Future::extractStorages(c10::IValue const&) 2025-05-07T20:03:53.7140459Z U c10::ivalue::Object::resizeObject(unsigned long) 2025-05-07T20:03:53.7140992Z U c10::ivalue::checkCustomClassType(c10::ClassType const*, c10::Type const*) 2025-05-07T20:03:53.7141503Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:53.7141894Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:53.7142311Z U c10::operator<<(std::ostream&, c10::FunctionSchema const&) 2025-05-07T20:03:53.7142708Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:53.7143107Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:53.7143548Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:53.7143902Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.7144286Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:53.7144654Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:53.7144996Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:53.7145353Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:53.7145684Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:53.7146038Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.7146389Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:53.7146742Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:53.7147090Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:53.7147433Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:53.7147795Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:53.7148154Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:53.7149170Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7150905Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7152774Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7154548Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7156359Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7158198Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7159902Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:53.7161779Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:53.7163592Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7165527Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:53.7167279Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7168950Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:53.7170616Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:53.7172391Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7174027Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:53.7175592Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:53.7177263Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7178980Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:53.7180769Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7182782Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMFP8WithStrides(long, bool, bool, bool, long, long, int, int, bool) 2025-05-07T20:03:53.7184793Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMNBitWithStrides(int, long, bool, bool, int, bool, bool, long, long, bool, bool, bool, int) 2025-05-07T20:03:53.7186762Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7188722Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7190619Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7192571Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7194531Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7196449Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7198457Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:53.7199662Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.7200073Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.7200485Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.7200867Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.7201547Z U linearize_cache_indices_cuda(at::Tensor const&, at::Tensor const&, at::Tensor const&, std::optional const&, long, long) 2025-05-07T20:03:53.7202263Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:53.7202684Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.7203081Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.7203945Z U lru_cache_populate_byte_cuda(at::Tensor, at::Tensor, long, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long, bool, std::optional) 2025-05-07T20:03:53.7205081Z U lxu_cache_lookup_cuda(at::Tensor, at::Tensor, long, bool, std::optional, std::optional, std::optional) 2025-05-07T20:03:53.7205721Z U memcpy@GLIBC_2.14 2025-05-07T20:03:53.7206008Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:53.7206300Z U memset@GLIBC_2.2.5 2025-05-07T20:03:53.7208462Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:53.7208853Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:53.7209307Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:53.7209986Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:53.7211066Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.7211959Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7212952Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:53.7213956Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7214837Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7215749Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:53.7216765Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7217716Z U std::__cxx11::basic_string, std::allocator >::find(char, unsigned long) const@GLIBCXX_3.4.21 2025-05-07T20:03:53.7218511Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7219261Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:53.7220366Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream(std::__cxx11::basic_string, std::allocator > const&, std::_Ios_Openmode)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7221528Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:53.7222338Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:53.7222931Z U std::__exception_ptr::exception_ptr::_M_addref() 2025-05-07T20:03:53.7223294Z U std::__exception_ptr::exception_ptr::_M_release() 2025-05-07T20:03:53.7223643Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:53.7223968Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:53.7224294Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:53.7224662Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.7225022Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:53.7225426Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:53.7225815Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:53.7226278Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:53.7227177Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.7227988Z U std::condition_variable::condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:53.7228417Z U std::condition_variable::notify_all()@GLIBCXX_3.4.11 2025-05-07T20:03:53.7228838Z U std::condition_variable::~condition_variable()@GLIBCXX_3.4.11 2025-05-07T20:03:53.7229224Z U std::current_exception()@CXXABI_1.3.3 2025-05-07T20:03:53.7229562Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:53.7229890Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:53.7230250Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:53.7230568Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:53.7230967Z U std::istream& std::istream::_M_extract(long&)@GLIBCXX_3.4.9 2025-05-07T20:03:53.7231578Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:53.7232374Z U std::logic_error::logic_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7233215Z U std::logic_error::~logic_error()@GLIBCXX_3.4 2025-05-07T20:03:53.7233654Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.7234195Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:53.7234683Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:53.7235083Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7235506Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:53.7235985Z U std::rethrow_exception(std::__exception_ptr::exception_ptr)@CXXABI_1.3.3 2025-05-07T20:03:53.7236508Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:53.7236950Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:53.7237306Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:53.7237628Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:53.7237911Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:53.7238239Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:53.7239081Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:53.7240269Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.7241123Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:53.7242614Z U torch::detail::class_base::class_base(std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator > const&, std::__cxx11::basic_string, std::allocator >, std::type_info const&, std::type_info const&) 2025-05-07T20:03:53.7244167Z U torch::detail::class_base::withNewArguments(c10::FunctionSchema const&, std::initializer_list) 2025-05-07T20:03:53.7245118Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:53.7245928Z U torch::registerCustomClassMethod(std::unique_ptr >) 2025-05-07T20:03:53.7246534Z U torch::serialize::InputArchive::InputArchive() 2025-05-07T20:03:53.7247102Z U torch::serialize::InputArchive::load_from(char const*, unsigned long, std::optional) 2025-05-07T20:03:53.7247934Z U torch::serialize::InputArchive::read(std::__cxx11::basic_string, std::allocator > const&, at::Tensor&, bool) 2025-05-07T20:03:53.7248764Z U torch::serialize::OutputArchive::OutputArchive(std::shared_ptr) 2025-05-07T20:03:53.7249325Z U torch::serialize::OutputArchive::save_to(std::ostream&) 2025-05-07T20:03:53.7250045Z U torch::serialize::OutputArchive::write(std::__cxx11::basic_string, std::allocator > const&, at::Tensor const&, bool) 2025-05-07T20:03:53.7250737Z U typeinfo for c10::Error 2025-05-07T20:03:53.7251045Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:53.7251381Z U typeinfo for std::logic_error@GLIBCXX_3.4 2025-05-07T20:03:53.7251766Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:53.7252110Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:53.7252524Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:53.7253036Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:53.7253482Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:53.7253869Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:53.7254264Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:53.7254608Z U vtable for c10::Error 2025-05-07T20:03:53.7255122Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:53.7255740Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:53.7256181Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:53.7256500Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:53.7256799Z w _ITM_registerTMCloneTable 2025-05-07T20:03:53.7257096Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:53.7257366Z w __gmon_start__ 2025-05-07T20:03:53.7257776Z w __pthread_key_create 2025-05-07T20:03:53.7258052Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:53.7258357Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:53.7258696Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:53.7259140Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.7259440Z 2025-05-07T20:03:53.7259587Z linux-vdso.so.1 (0x00007ffe2f304000) 2025-05-07T20:03:53.7259845Z libc10.so => not found 2025-05-07T20:03:53.7260070Z libc10_cuda.so => not found 2025-05-07T20:03:53.7260558Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f72dfc00000) 2025-05-07T20:03:53.7261432Z fbgemm_gpu_tbe_cache.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so (0x00007f72dee00000) 2025-05-07T20:03:53.7262055Z libtorch.so => not found 2025-05-07T20:03:53.7262288Z libtorch_cpu.so => not found 2025-05-07T20:03:53.7262540Z libtorch_cuda.so => not found 2025-05-07T20:03:53.7262788Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.7263098Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f72deb9c000) 2025-05-07T20:03:53.7263477Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f72eda5e000) 2025-05-07T20:03:53.7263839Z libc.so.6 => /lib64/libc.so.6 (0x00007f72de994000) 2025-05-07T20:03:53.7264137Z libc10.so => not found 2025-05-07T20:03:53.7264605Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f72ed9e4000) 2025-05-07T20:03:53.7265132Z libtorch.so => not found 2025-05-07T20:03:53.7265415Z libtorch_cpu.so => not found 2025-05-07T20:03:53.7265670Z libtorch_cuda.so => not found 2025-05-07T20:03:53.7265935Z libm.so.6 => /lib64/libm.so.6 (0x00007f72dfb25000) 2025-05-07T20:03:53.7266269Z /lib64/ld-linux-x86-64.so.2 (0x00007f72eda92000) 2025-05-07T20:03:53.7266561Z libtorch.so => not found 2025-05-07T20:03:53.7266795Z libc10.so => not found 2025-05-07T20:03:53.7267028Z libc10_cuda.so => not found 2025-05-07T20:03:53.7267261Z libtorch_cpu.so => not found 2025-05-07T20:03:53.7267510Z libtorch_cuda.so => not found 2025-05-07T20:03:53.7267753Z libcudart.so.11.0 => not found 2025-05-07T20:03:53.7268057Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f72ed98a000) 2025-05-07T20:03:53.7268404Z libtorch_cpu.so => not found 2025-05-07T20:03:53.7268653Z libtorch_cuda.so => not found 2025-05-07T20:03:53.7268893Z libtorch.so => not found 2025-05-07T20:03:53.7269161Z librt.so.1 => /lib64/librt.so.1 (0x00007f72ed983000) 2025-05-07T20:03:53.7269567Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f72ed97e000) 2025-05-07T20:03:53.7269844Z 2025-05-07T20:03:53.7269942Z [CHECK] Displaying ELF information: 2025-05-07T20:03:53.7270356Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so 2025-05-07T20:03:53.7270679Z 2025-05-07T20:03:53.7270707Z 2025-05-07T20:03:53.7270859Z Dynamic section at offset 0xd2d8688 contains 38 entries: 2025-05-07T20:03:53.7271213Z Tag Type Name/Value 2025-05-07T20:03:53.7271592Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:53.7272062Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:53.7272531Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:53.7273283Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_cache.so] 2025-05-07T20:03:53.7273823Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:53.7274335Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:53.7274855Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:53.7275376Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:53.7275906Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:53.7276418Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:53.7276909Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:53.7277437Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_inference.so] 2025-05-07T20:03:53.7277964Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:53.7278372Z 0x000000000000000c (INIT) 0x19c000 2025-05-07T20:03:53.7278699Z 0x000000000000000d (FINI) 0x73d58c 2025-05-07T20:03:53.7279045Z 0x0000000000000019 (INIT_ARRAY) 0xd2d69c0 2025-05-07T20:03:53.7279406Z 0x000000000000001b (INIT_ARRAYSZ) 392 (bytes) 2025-05-07T20:03:53.7279752Z 0x000000000000001a (FINI_ARRAY) 0xd2d6b48 2025-05-07T20:03:53.7280100Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:53.7280434Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:53.7280768Z 0x0000000000000005 (STRTAB) 0x25568 2025-05-07T20:03:53.7281089Z 0x0000000000000006 (SYMTAB) 0x7cd0 2025-05-07T20:03:53.7281446Z 0x000000000000000a (STRSZ) 1383267 (bytes) 2025-05-07T20:03:53.7281805Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:53.7282156Z 0x0000000000000003 (PLTGOT) 0xd2d8928 2025-05-07T20:03:53.7282518Z 0x0000000000000002 (PLTRELSZ) 20640 (bytes) 2025-05-07T20:03:53.7282858Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:53.7283186Z 0x0000000000000017 (JMPREL) 0x196378 2025-05-07T20:03:53.7283510Z 0x0000000000000007 (RELA) 0x179950 2025-05-07T20:03:53.7283901Z 0x0000000000000008 (RELASZ) 117288 (bytes) 2025-05-07T20:03:53.7284259Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:53.7284584Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:53.7284917Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:53.7285257Z 0x000000006ffffffe (VERNEED) 0x179830 2025-05-07T20:03:53.7285689Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:53.7285979Z 0x000000006ffffff0 (VERSYM) 0x1770cc 2025-05-07T20:03:53.7286289Z 0x000000006ffffff9 (RELACOUNT) 447 2025-05-07T20:03:53.7286596Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:53.7286791Z 2025-05-07T20:03:53.7286891Z ################################################################################ 2025-05-07T20:03:53.7287099Z 2025-05-07T20:03:53.7287103Z 2025-05-07T20:03:53.7287250Z ################################################################################ 2025-05-07T20:03:53.7287766Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:53.7288278Z [CHECK] Listing out library size: 2025-05-07T20:03:53.7288767Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:53.7289159Z 2025-05-07T20:03:53.7289385Z 188 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:53.7289708Z 2025-05-07T20:03:53.7290110Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:53.7291090Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.7291687Z 2025-05-07T20:03:53.8184992Z GLIBC_2.2.5 2025-05-07T20:03:53.8185648Z GLIBC_2.3 2025-05-07T20:03:53.8186218Z GLIBC_2.14 2025-05-07T20:03:53.8186546Z 2025-05-07T20:03:53.8186559Z 2025-05-07T20:03:53.8187927Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:53.8191197Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:53.8192559Z 2025-05-07T20:03:53.9138690Z GLIBCXX_3.4 2025-05-07T20:03:53.9139349Z GLIBCXX_3.4.9 2025-05-07T20:03:53.9139950Z GLIBCXX_3.4.20 2025-05-07T20:03:53.9140546Z GLIBCXX_3.4.21 2025-05-07T20:03:53.9140904Z 2025-05-07T20:03:53.9140917Z 2025-05-07T20:03:53.9157561Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.828L5YBOKF.symbols.txt 2025-05-07T20:03:53.9159113Z 2025-05-07T20:03:54.0082897Z 2025-05-07T20:03:54.0127663Z [CHECK] Total Number of symbols: 12561 2025-05-07T20:03:54.0177114Z [CHECK] Number of fbgemm symbols: 5267 2025-05-07T20:03:54.0195451Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so > /tmp/tmp.MJJLaGvOIp.usymbols.txt 2025-05-07T20:03:54.0197045Z 2025-05-07T20:03:54.0243384Z 2025-05-07T20:03:54.0269234Z [CHECK] Listing out undefined symbols (175 total): 2025-05-07T20:03:54.0282804Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.0283674Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.0284050Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.0284460Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.0284869Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.0285252Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.0285643Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.0286022Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.0286564Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.0286951Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.0287270Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.0287597Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.0287920Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.0288252Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.0288587Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.0288907Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.0289307Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.0289734Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.0290148Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.0290430Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.0290911Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.0291272Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:54.0291670Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:54.0292220Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:54.0292892Z U at::Tensor fbgemm_gpu::reshape_vbe_offsets(at::Tensor const&, at::Tensor const&, long, int) 2025-05-07T20:03:54.0293499Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:54.0294066Z U at::Tensor::index_put_(std::initializer_list, at::Tensor const&) 2025-05-07T20:03:54.0295055Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.0295957Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:54.0296420Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.0296878Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:54.0297316Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:54.0297743Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.0298223Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.0298610Z U c10::BoolType::get() 2025-05-07T20:03:54.0298959Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.0299390Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:54.0299771Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:54.0300483Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:54.0301676Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:54.0302719Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.0303287Z U c10::Error::what() const 2025-05-07T20:03:54.0303572Z U c10::FloatType::get() 2025-05-07T20:03:54.0303907Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.0304320Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.0304722Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.0305091Z U c10::IntType::get() 2025-05-07T20:03:54.0305427Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.0305804Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.0306132Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.0306473Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.0306835Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:54.0307205Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.0307581Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:54.0308231Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.0308875Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.0309233Z U c10::SymInt::operator+=(c10::SymInt const&) 2025-05-07T20:03:54.0309575Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:54.0309928Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.0310309Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:54.0310665Z U c10::SymInt::sym_ge(c10::SymInt const&) const 2025-05-07T20:03:54.0311015Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:54.0311349Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:54.0311692Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:54.0312012Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.0312308Z U c10::SymIntType::get() 2025-05-07T20:03:54.0312583Z U c10::TensorType::get() 2025-05-07T20:03:54.0313027Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.0314194Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.0315179Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.0315555Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.0315899Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.0316251Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.0316593Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.0316942Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.0317447Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.0317933Z U c10::cuda::device_count() 2025-05-07T20:03:54.0318297Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.0318707Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.0319092Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.0319600Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.0319984Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.0320350Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.0321074Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.0321910Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.0322725Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.0323642Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.0324624Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.0325388Z U c10::get_default_dtype() 2025-05-07T20:03:54.0325688Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.0326013Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.0326581Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:54.0327164Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:54.0327608Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.0327933Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.0328306Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:54.0328665Z U c10::operator%(c10::SymInt const&, int) 2025-05-07T20:03:54.0329036Z U c10::operator*(c10::SymInt const&, long) 2025-05-07T20:03:54.0329573Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:54.0329912Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:54.0330315Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.0330700Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.0344312Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:54.0344725Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:54.0345070Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.0345638Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.0346084Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.0346448Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.0346818Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.0347172Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.0347514Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.0347855Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.0348194Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.0348550Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.0348887Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.0349227Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.0349560Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.0349909Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.0350276Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.0350787Z U fbgemm_gpu::reshape_vbe_output(at::Tensor const&, long, at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.0351303Z U float at::Tensor::item() const 2025-05-07T20:03:54.0351657Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.0352054Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.0352413Z U free@GLIBC_2.2.5 2025-05-07T20:03:54.0352806Z U int at::Tensor::item() const 2025-05-07T20:03:54.0353149Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.0353702Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.0354140Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.0354709Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.0355116Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.0355483Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.0355767Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.0356082Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.0356425Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.0357016Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.0357925Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.0358835Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.0360029Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.0361047Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.0361893Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.0362804Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.0363826Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.0364542Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.0364912Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.0365301Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.0365801Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.0366674Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.0367434Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:54.0367777Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.0368107Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.0368426Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.0368536Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.0368719Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.0368944Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.0369065Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.0369171Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.0369261Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.0369374Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.0369938Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.0370380Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.0370651Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.0371002Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.0371100Z U typeinfo for c10::Error 2025-05-07T20:03:54.0371240Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.0371399Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.0371550Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.0371671Z U vtable for c10::Error 2025-05-07T20:03:54.0371994Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.0372233Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.0372398Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.0372511Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.0372607Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.0372730Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.0372826Z w __gmon_start__ 2025-05-07T20:03:54.0372916Z w __pthread_key_create 2025-05-07T20:03:54.0373053Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.0373288Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:54.0373298Z 2025-05-07T20:03:54.0373436Z linux-vdso.so.1 (0x00007ffc577fa000) 2025-05-07T20:03:54.0373519Z libc10.so => not found 2025-05-07T20:03:54.0373618Z libc10_cuda.so => not found 2025-05-07T20:03:54.0374074Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f5740c0a000) 2025-05-07T20:03:54.0374162Z libtorch.so => not found 2025-05-07T20:03:54.0374258Z libtorch_cpu.so => not found 2025-05-07T20:03:54.0374346Z libtorch_cuda.so => not found 2025-05-07T20:03:54.0374435Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.0374583Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f57409a6000) 2025-05-07T20:03:54.0374729Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f5740978000) 2025-05-07T20:03:54.0374842Z libc.so.6 => /lib64/libc.so.6 (0x00007f5740770000) 2025-05-07T20:03:54.0374957Z /lib64/ld-linux-x86-64.so.2 (0x00007f574d178000) 2025-05-07T20:03:54.0375046Z libc10.so => not found 2025-05-07T20:03:54.0375133Z libc10_cuda.so => not found 2025-05-07T20:03:54.0375470Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f5740200000) 2025-05-07T20:03:54.0375896Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f574d165000) 2025-05-07T20:03:54.0375982Z libtorch.so => not found 2025-05-07T20:03:54.0376072Z libtorch_cpu.so => not found 2025-05-07T20:03:54.0376163Z libtorch_cuda.so => not found 2025-05-07T20:03:54.0376261Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.0376403Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f57401aa000) 2025-05-07T20:03:54.0376513Z libm.so.6 => /lib64/libm.so.6 (0x00007f57400cf000) 2025-05-07T20:03:54.0376606Z libc10.so => not found 2025-05-07T20:03:54.0376938Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f5740057000) 2025-05-07T20:03:54.0377024Z libtorch.so => not found 2025-05-07T20:03:54.0377114Z libtorch_cpu.so => not found 2025-05-07T20:03:54.0377213Z libtorch_cuda.so => not found 2025-05-07T20:03:54.0377300Z libtorch.so => not found 2025-05-07T20:03:54.0377379Z libc10.so => not found 2025-05-07T20:03:54.0377479Z libtorch_cpu.so => not found 2025-05-07T20:03:54.0377565Z libtorch_cuda.so => not found 2025-05-07T20:03:54.0377751Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f574076b000) 2025-05-07T20:03:54.0377840Z libtorch_cpu.so => not found 2025-05-07T20:03:54.0377940Z libtorch_cuda.so => not found 2025-05-07T20:03:54.0378023Z libtorch.so => not found 2025-05-07T20:03:54.0378145Z librt.so.1 => /lib64/librt.so.1 (0x00007f5740764000) 2025-05-07T20:03:54.0378150Z 2025-05-07T20:03:54.0378259Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.0378503Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so 2025-05-07T20:03:54.0378507Z 2025-05-07T20:03:54.0389034Z 2025-05-07T20:03:54.0389600Z Dynamic section at offset 0xbaf1f50 contains 38 entries: 2025-05-07T20:03:54.0390035Z Tag Type Name/Value 2025-05-07T20:03:54.0390673Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.0391277Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.0392006Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:54.0392581Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.0393375Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.0393962Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.0394175Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.0394374Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.0394588Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.0394780Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.0394992Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.0395264Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_forward.so] 2025-05-07T20:03:54.0395448Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.0395560Z 0x000000000000000c (INIT) 0x448000 2025-05-07T20:03:54.0395686Z 0x000000000000000d (FINI) 0x1fced1c 2025-05-07T20:03:54.0395804Z 0x0000000000000019 (INIT_ARRAY) 0xbaea2f0 2025-05-07T20:03:54.0395931Z 0x000000000000001b (INIT_ARRAYSZ) 752 (bytes) 2025-05-07T20:03:54.0396046Z 0x000000000000001a (FINI_ARRAY) 0xbaea5e0 2025-05-07T20:03:54.0396167Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.0396276Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.0396386Z 0x0000000000000005 (STRTAB) 0x5dd10 2025-05-07T20:03:54.0396502Z 0x0000000000000006 (SYMTAB) 0x14360 2025-05-07T20:03:54.0396644Z 0x000000000000000a (STRSZ) 3688571 (bytes) 2025-05-07T20:03:54.0396758Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.0396876Z 0x0000000000000003 (PLTGOT) 0xbaf21f0 2025-05-07T20:03:54.0397013Z 0x0000000000000002 (PLTRELSZ) 14520 (bytes) 2025-05-07T20:03:54.0397119Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.0397235Z 0x0000000000000017 (JMPREL) 0x443ae8 2025-05-07T20:03:54.0397351Z 0x0000000000000007 (RELA) 0x3e88a0 2025-05-07T20:03:54.0397484Z 0x0000000000000008 (RELASZ) 373320 (bytes) 2025-05-07T20:03:54.0397602Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.0397710Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.0397830Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.0397946Z 0x000000006ffffffe (VERNEED) 0x3e87b0 2025-05-07T20:03:54.0398051Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.0398171Z 0x000000006ffffff0 (VERSYM) 0x3e258c 2025-05-07T20:03:54.0398279Z 0x000000006ffffff9 (RELACOUNT) 1838 2025-05-07T20:03:54.0398379Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.0398383Z 2025-05-07T20:03:54.0398536Z ################################################################################ 2025-05-07T20:03:54.0398541Z 2025-05-07T20:03:54.0398545Z 2025-05-07T20:03:54.0398660Z ################################################################################ 2025-05-07T20:03:54.0399016Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.0399127Z [CHECK] Listing out library size: 2025-05-07T20:03:54.0399471Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.0399502Z 2025-05-07T20:03:54.0401019Z 5 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.0402019Z 2025-05-07T20:03:54.0402931Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.0403568Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.0403574Z 2025-05-07T20:03:54.0667301Z GLIBC_2.2.5 2025-05-07T20:03:54.0668362Z GLIBC_2.3 2025-05-07T20:03:54.0668469Z GLIBC_2.14 2025-05-07T20:03:54.0668476Z 2025-05-07T20:03:54.0668482Z 2025-05-07T20:03:54.0669017Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.0669623Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.0669632Z 2025-05-07T20:03:54.0926227Z GLIBCXX_3.4 2025-05-07T20:03:54.0926506Z GLIBCXX_3.4.9 2025-05-07T20:03:54.0926793Z GLIBCXX_3.4.11 2025-05-07T20:03:54.0927045Z GLIBCXX_3.4.15 2025-05-07T20:03:54.0927283Z GLIBCXX_3.4.18 2025-05-07T20:03:54.0927504Z GLIBCXX_3.4.20 2025-05-07T20:03:54.0927733Z GLIBCXX_3.4.21 2025-05-07T20:03:54.0927751Z 2025-05-07T20:03:54.0927763Z 2025-05-07T20:03:54.0949020Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.DtLQ89lU01.symbols.txt 2025-05-07T20:03:54.0949065Z 2025-05-07T20:03:54.1159024Z 2025-05-07T20:03:54.1185422Z [CHECK] Total Number of symbols: 2987 2025-05-07T20:03:54.1217521Z [CHECK] Number of fbgemm symbols: 1 2025-05-07T20:03:54.1235917Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so > /tmp/tmp.TV4kUVnENB.usymbols.txt 2025-05-07T20:03:54.1235949Z 2025-05-07T20:03:54.1268476Z 2025-05-07T20:03:54.1299798Z [CHECK] Listing out undefined symbols (196 total): 2025-05-07T20:03:54.1318735Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.1319109Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.1319217Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.1319358Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.1319463Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.1319570Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.1319682Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.1319792Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.1319896Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.1320003Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.1320119Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.1320217Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.1320314Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.1320433Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.1320715Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.1320836Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:54.1321030Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:54.1321164Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:54.1321276Z U at::RecordFunction::end() 2025-05-07T20:03:54.1321413Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:54.1321604Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:54.1322378Z U at::_ops::_sparse_coo_tensor_unsafe::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1322770Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:54.1324761Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1325467Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1325636Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:54.1325833Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:54.1326028Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.1326171Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:54.1326345Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.1326477Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:54.1326633Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:54.1326771Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:54.1326867Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:54.1326968Z U c10::AnyType::get() 2025-05-07T20:03:54.1327075Z U c10::BoolType::get() 2025-05-07T20:03:54.1327258Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:54.1327373Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:54.1327915Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:54.1328566Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:54.1328946Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.1329064Z U c10::Error::what() const 2025-05-07T20:03:54.1329163Z U c10::FloatType::get() 2025-05-07T20:03:54.1329270Z U c10::GradMode::is_enabled() 2025-05-07T20:03:54.1329386Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:54.1329545Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:54.1329667Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:54.1329788Z U c10::IValue::isBoolList() const 2025-05-07T20:03:54.1329898Z U c10::IValue::isIntList() const 2025-05-07T20:03:54.1330015Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:54.1330249Z U c10::IValue::isTensorList() const 2025-05-07T20:03:54.1330400Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.1330497Z U c10::IntType::get() 2025-05-07T20:03:54.1330990Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.1331169Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.1331487Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.1331614Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.1331784Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.1332008Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.1332339Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:54.1332459Z U c10::StringType::get() 2025-05-07T20:03:54.1332602Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:54.1332744Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.1332960Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:54.1333111Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:54.1333265Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:54.1333700Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.1333843Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.1333971Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:54.1334124Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:54.1334261Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:54.1334380Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.1334525Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:54.1334658Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:54.1334781Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:54.1334904Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.1335005Z U c10::SymIntType::get() 2025-05-07T20:03:54.1335129Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:54.1335231Z U c10::TensorType::get() 2025-05-07T20:03:54.1335367Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.1335821Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.1336359Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.1336641Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.1337158Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.1337511Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.1338124Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.1338715Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:54.1338927Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:54.1339048Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.1339205Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:54.1339594Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.1339720Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:54.1339884Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:54.1340081Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:54.1340222Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:54.1340449Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.1340579Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:54.1340841Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:54.1340935Z U free@GLIBC_2.2.5 2025-05-07T20:03:54.1341129Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.1341235Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:54.1341325Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.1341421Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:54.1341518Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.1341635Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.1341758Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.1341860Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:54.1342078Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:54.1342423Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.1342833Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.1343352Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1343885Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.1344383Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1344773Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1345273Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1345608Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1346121Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1346442Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:54.1346795Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.1347167Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.1347290Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.1347397Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:54.1347529Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.1347672Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.1347833Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.1347984Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:54.1348123Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:54.1348351Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.1348925Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.1349054Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:54.1349188Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.1349303Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.1349417Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.1349518Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.1349687Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.1349919Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.1350033Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.1350187Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1350323Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:54.1350723Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1350847Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:54.1350958Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.1351049Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:54.1351133Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.1351250Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.1351814Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.1352251Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.1352495Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.1352681Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:54.1352969Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:54.1353332Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:54.1353537Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:54.1353727Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:54.1354087Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:54.1354240Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:54.1354460Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:54.1354641Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:54.1354760Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:54.1354874Z U torch::autograd::Node::metadata() 2025-05-07T20:03:54.1355018Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:54.1355264Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:54.1355563Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:54.1355706Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:54.1355917Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:54.1356165Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:54.1358945Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:54.1359113Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:54.1359265Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:54.1359427Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:54.1360239Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:54.1360391Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:54.1360810Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:54.1361190Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.1361292Z U typeinfo for c10::Error 2025-05-07T20:03:54.1361436Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:54.1361571Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:54.1361698Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:54.1361829Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:54.1361954Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:54.1362104Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.1362265Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.1362430Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:54.1362583Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.1362680Z U vtable for c10::Error 2025-05-07T20:03:54.1363055Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.1363186Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:54.1363413Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.1363533Z U vtable for torch::autograd::Node 2025-05-07T20:03:54.1363709Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.1363818Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.1363922Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.1364061Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.1364148Z w __gmon_start__ 2025-05-07T20:03:54.1364241Z w __pthread_key_create 2025-05-07T20:03:54.1364357Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.1364493Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.1364657Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.1364955Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.1364963Z 2025-05-07T20:03:54.1369488Z linux-vdso.so.1 (0x00007fffeebe0000) 2025-05-07T20:03:54.1369687Z libc10.so => not found 2025-05-07T20:03:54.1370147Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fbe599ad000) 2025-05-07T20:03:54.1370618Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fbe58a00000) 2025-05-07T20:03:54.1370917Z libtorch.so => not found 2025-05-07T20:03:54.1371023Z libtorch_cpu.so => not found 2025-05-07T20:03:54.1371119Z libtorch_cuda.so => not found 2025-05-07T20:03:54.1371280Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbe5879c000) 2025-05-07T20:03:54.1371442Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbe5997d000) 2025-05-07T20:03:54.1371567Z libc.so.6 => /lib64/libc.so.6 (0x00007fbe58594000) 2025-05-07T20:03:54.1371692Z /lib64/ld-linux-x86-64.so.2 (0x00007fbe599bc000) 2025-05-07T20:03:54.1371779Z libtorch.so => not found 2025-05-07T20:03:54.1371872Z libc10.so => not found 2025-05-07T20:03:54.1371963Z libtorch_cpu.so => not found 2025-05-07T20:03:54.1372053Z libtorch_cuda.so => not found 2025-05-07T20:03:54.1372206Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fbe59925000) 2025-05-07T20:03:54.1372377Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fbe59920000) 2025-05-07T20:03:54.1372464Z libtorch.so => not found 2025-05-07T20:03:54.1372548Z libc10.so => not found 2025-05-07T20:03:54.1372649Z libc10_cuda.so => not found 2025-05-07T20:03:54.1372737Z libtorch_cpu.so => not found 2025-05-07T20:03:54.1372826Z libtorch_cuda.so => not found 2025-05-07T20:03:54.1372932Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.1373055Z libm.so.6 => /lib64/libm.so.6 (0x00007fbe59325000) 2025-05-07T20:03:54.1373072Z 2025-05-07T20:03:54.1373177Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.1373506Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so 2025-05-07T20:03:54.1373511Z 2025-05-07T20:03:54.1404194Z 2025-05-07T20:03:54.1405318Z Dynamic section at offset 0x4b06b0 contains 37 entries: 2025-05-07T20:03:54.1405473Z Tag Type Name/Value 2025-05-07T20:03:54.1405721Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.1405959Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_config.so] 2025-05-07T20:03:54.1406204Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:54.1406413Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.1406641Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.1406865Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.1407965Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.1408226Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.1408438Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.1408671Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.1409007Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_split_host.so] 2025-05-07T20:03:54.1409196Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.1409373Z 0x000000000000000c (INIT) 0xd0000 2025-05-07T20:03:54.1409512Z 0x000000000000000d (FINI) 0x3f2b18 2025-05-07T20:03:54.1409631Z 0x0000000000000019 (INIT_ARRAY) 0x4a9ff8 2025-05-07T20:03:54.1409821Z 0x000000000000001b (INIT_ARRAYSZ) 304 (bytes) 2025-05-07T20:03:54.1409945Z 0x000000000000001a (FINI_ARRAY) 0x4aa128 2025-05-07T20:03:54.1410093Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.1410214Z 0x000000006ffffef5 (GNU_HASH) 0x200 2025-05-07T20:03:54.1410332Z 0x0000000000000005 (STRTAB) 0x15da8 2025-05-07T20:03:54.1410525Z 0x0000000000000006 (SYMTAB) 0x4588 2025-05-07T20:03:54.1410672Z 0x000000000000000a (STRSZ) 609567 (bytes) 2025-05-07T20:03:54.1410799Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.1410931Z 0x0000000000000003 (PLTGOT) 0x4b1940 2025-05-07T20:03:54.1411096Z 0x0000000000000002 (PLTRELSZ) 31704 (bytes) 2025-05-07T20:03:54.1411217Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.1411340Z 0x0000000000000017 (JMPREL) 0xc7630 2025-05-07T20:03:54.1411476Z 0x0000000000000007 (RELA) 0xac330 2025-05-07T20:03:54.1411618Z 0x0000000000000008 (RELASZ) 111360 (bytes) 2025-05-07T20:03:54.1411745Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.1411872Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.1412002Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.1412125Z 0x000000006ffffffe (VERNEED) 0xac220 2025-05-07T20:03:54.1412244Z 0x000000006fffffff (VERNEEDNUM) 4 2025-05-07T20:03:54.1412391Z 0x000000006ffffff0 (VERSYM) 0xaaac8 2025-05-07T20:03:54.1412507Z 0x000000006ffffff9 (RELACOUNT) 40 2025-05-07T20:03:54.1412616Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.1412645Z 2025-05-07T20:03:54.1412787Z ################################################################################ 2025-05-07T20:03:54.1412793Z 2025-05-07T20:03:54.1412798Z 2025-05-07T20:03:54.1412919Z ################################################################################ 2025-05-07T20:03:54.1413220Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1413360Z [CHECK] Listing out library size: 2025-05-07T20:03:54.1413661Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1413666Z 2025-05-07T20:03:54.1419725Z 18 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1419757Z 2025-05-07T20:03:54.1420182Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1420709Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.1420717Z 2025-05-07T20:03:54.1535343Z GLIBC_2.2.5 2025-05-07T20:03:54.1535612Z GLIBC_2.3 2025-05-07T20:03:54.1535867Z GLIBC_2.14 2025-05-07T20:03:54.1535981Z 2025-05-07T20:03:54.1536035Z 2025-05-07T20:03:54.1537404Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1539359Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.1539401Z 2025-05-07T20:03:54.1658359Z GLIBCXX_3.4 2025-05-07T20:03:54.1659468Z GLIBCXX_3.4.9 2025-05-07T20:03:54.1659738Z GLIBCXX_3.4.11 2025-05-07T20:03:54.1659974Z GLIBCXX_3.4.15 2025-05-07T20:03:54.1660209Z GLIBCXX_3.4.18 2025-05-07T20:03:54.1660429Z GLIBCXX_3.4.20 2025-05-07T20:03:54.1660649Z GLIBCXX_3.4.21 2025-05-07T20:03:54.1660667Z 2025-05-07T20:03:54.1660680Z 2025-05-07T20:03:54.1676843Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.WqhvOgIEGm.symbols.txt 2025-05-07T20:03:54.1677189Z 2025-05-07T20:03:54.1759130Z 2025-05-07T20:03:54.1786534Z [CHECK] Total Number of symbols: 1515 2025-05-07T20:03:54.1798918Z [CHECK] Number of fbgemm symbols: 211 2025-05-07T20:03:54.1818651Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so > /tmp/tmp.c8GiuoUgUZ.usymbols.txt 2025-05-07T20:03:54.1818700Z 2025-05-07T20:03:54.1837825Z 2025-05-07T20:03:54.1862443Z [CHECK] Listing out undefined symbols (273 total): 2025-05-07T20:03:54.1876455Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.1877520Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.1877997Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.1878435Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.1878882Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.1879267Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.1879665Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.1880059Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.1880408Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.1880799Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.1881145Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.1881439Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.1881736Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.1882024Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.1882342Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.1882630Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.1882930Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.1883235Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.1883527Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:54.1883810Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.1884111Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:54.1884394Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.1884814Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.1884904Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.1885018Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:54.1885155Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:54.1885319Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:54.1885453Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:54.1885573Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:54.1885716Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:54.1885923Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:54.1886041Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:54.1886228Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:54.1886401Z U at::_ops::concat::call(c10::ArrayRef, long) 2025-05-07T20:03:54.1886557Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:54.1887120Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1887763Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1887980Z U at::_ops::eq_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.1888144Z U at::_ops::eq_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.1888389Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:54.1888560Z U at::_ops::gt_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.1888885Z U at::_ops::index_add::call(at::Tensor const&, long, at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.1889108Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:54.1889231Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:54.1889399Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.1889616Z U at::_ops::narrow::call(at::Tensor const&, long, c10::SymInt, c10::SymInt) 2025-05-07T20:03:54.1889791Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:54.1890030Z U at::_ops::split_with_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:54.1890357Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:54.1890969Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:54.1891171Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.1891327Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.1891785Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1892342Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.1892472Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:54.1892593Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:54.1892754Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:54.1892854Z U at::globalContext() 2025-05-07T20:03:54.1892990Z U at::has_internal_overlap(at::TensorBase const&) 2025-05-07T20:03:54.1893121Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:54.1893212Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:54.1893319Z U bool at::Tensor::item() const 2025-05-07T20:03:54.1893410Z U c10::AnyType::get() 2025-05-07T20:03:54.1893583Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:54.1893784Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.1893910Z U c10::BoolType::get() 2025-05-07T20:03:54.1894085Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.1894265Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:54.1894376Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:54.1894884Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:54.1895486Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:54.1895888Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.1896021Z U c10::Error::what() const 2025-05-07T20:03:54.1896126Z U c10::GradMode::is_enabled() 2025-05-07T20:03:54.1896232Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:54.1896414Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.1896595Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:54.1896707Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:54.1896828Z U c10::IValue::isBoolList() const 2025-05-07T20:03:54.1896930Z U c10::IValue::isIntList() const 2025-05-07T20:03:54.1897040Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:54.1897166Z U c10::IValue::isTensorList() const 2025-05-07T20:03:54.1897303Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.1897401Z U c10::IntType::get() 2025-05-07T20:03:54.1897871Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.1898035Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.1898151Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.1898289Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.1898409Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.1898672Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:54.1898837Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.1898935Z U c10::StringType::get() 2025-05-07T20:03:54.1899072Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.1899479Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.1899612Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.1899726Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.1899841Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.1899939Z U c10::SymIntType::get() 2025-05-07T20:03:54.1900086Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:54.1900203Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:54.1900636Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:54.1900785Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.1900886Z U c10::TensorType::get() 2025-05-07T20:03:54.1901084Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:54.1901219Z U c10::Type::is_module() const 2025-05-07T20:03:54.1901343Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.1902031Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.1902158Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.1902277Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.1902406Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.1902594Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.1902715Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.1902844Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.1903107Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.1903209Z U c10::cuda::device_count() 2025-05-07T20:03:54.1903360Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.1903508Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.1903648Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.1903794Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.1903944Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.1904053Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.1904480Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.1904970Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.1905213Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.1905697Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.1906019Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.1906585Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.1906851Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:54.1907108Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:54.1907309Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:54.1907423Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.1907530Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.1907851Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:54.1908029Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:54.1908170Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:54.1908345Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:54.1908461Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.1908576Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.1908727Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:54.1909120Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.1909262Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.1909409Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.1909565Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:54.1909676Z U c10::throwNullDataPtrError() 2025-05-07T20:03:54.1909797Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:54.1909916Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.1910048Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:54.1910235Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.1910367Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:54.1910526Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.1910653Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.1910797Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.1910910Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.1911057Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.1911183Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.1911296Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.1911415Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.1911537Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.1911684Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:54.1911799Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.1911912Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.1912046Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.1912162Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.1912285Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.1912406Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.1912702Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:54.1912879Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.1912971Z U free@GLIBC_2.2.5 2025-05-07T20:03:54.1913302Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.1913402Z U log2f@GLIBC_2.2.5 2025-05-07T20:03:54.1913520Z U long at::Tensor::item() const 2025-05-07T20:03:54.1913706Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.1913843Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.1914074Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.1914182Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:54.1914280Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.1914376Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:54.1914467Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.1914589Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.1914711Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.1914805Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:54.1915029Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:54.1915375Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.1915770Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.1916221Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1916773Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.1917180Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1917593Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1918076Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.1918645Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1919232Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1919580Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:54.1919962Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.1920081Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.1920205Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:54.1920348Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.1920488Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.1920668Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.1920799Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:54.1920940Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:54.1921198Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.1921786Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.1921919Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:54.1922046Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.1922164Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.1922283Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.1922403Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.1922585Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.1922828Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.1922965Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.1923132Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1923264Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:54.1923447Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1923894Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.1924064Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:54.1924182Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.1924277Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:54.1924368Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.1924490Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.1925092Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.1925558Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.1925858Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.1926008Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:54.1926304Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:54.1926493Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:54.1926727Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:54.1926915Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:54.1927272Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:54.1927421Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:54.1927612Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:54.1927796Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:54.1927922Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:54.1928037Z U torch::autograd::Node::metadata() 2025-05-07T20:03:54.1928169Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:54.1928428Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:54.1928701Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:54.1928837Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:54.1929052Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:54.1929275Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:54.1932232Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:54.1932391Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:54.1932557Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:54.1932721Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:54.1932881Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:54.1933393Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:54.1933766Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.1934324Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:54.1934440Z U typeinfo for c10::Error 2025-05-07T20:03:54.1936030Z U typeinfo for c10::Type 2025-05-07T20:03:54.1936175Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:54.1936314Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:54.1936487Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:54.1936604Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:54.1936773Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.1936934Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.1937124Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:54.1937294Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.1937450Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.1937548Z U vtable for c10::Error 2025-05-07T20:03:54.1937657Z U vtable for c10::ListType 2025-05-07T20:03:54.1937995Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.1938129Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:54.1938372Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.1938502Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:54.1938614Z U vtable for torch::autograd::Node 2025-05-07T20:03:54.1938802Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.1938912Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.1939018Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.1939123Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.1939218Z w __gmon_start__ 2025-05-07T20:03:54.1939312Z w __pthread_key_create 2025-05-07T20:03:54.1939424Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.1939544Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.1939687Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.1939909Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1939917Z 2025-05-07T20:03:54.1940074Z linux-vdso.so.1 (0x00007fff46518000) 2025-05-07T20:03:54.1940162Z libc10.so => not found 2025-05-07T20:03:54.1940254Z libc10_cuda.so => not found 2025-05-07T20:03:54.1940843Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fb707d08000) 2025-05-07T20:03:54.1941316Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fb707400000) 2025-05-07T20:03:54.1941409Z libtorch.so => not found 2025-05-07T20:03:54.1941504Z libtorch_cpu.so => not found 2025-05-07T20:03:54.1941611Z libtorch_cuda.so => not found 2025-05-07T20:03:54.1941709Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.1941873Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fb70719c000) 2025-05-07T20:03:54.1942011Z libm.so.6 => /lib64/libm.so.6 (0x00007fb7070c1000) 2025-05-07T20:03:54.1942159Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fb7091c8000) 2025-05-07T20:03:54.1942306Z libc.so.6 => /lib64/libc.so.6 (0x00007fb706eb9000) 2025-05-07T20:03:54.1942443Z /lib64/ld-linux-x86-64.so.2 (0x00007fb7091fc000) 2025-05-07T20:03:54.1942526Z libc10.so => not found 2025-05-07T20:03:54.1942622Z libc10_cuda.so => not found 2025-05-07T20:03:54.1942711Z libtorch.so => not found 2025-05-07T20:03:54.1942814Z libtorch_cpu.so => not found 2025-05-07T20:03:54.1942905Z libtorch_cuda.so => not found 2025-05-07T20:03:54.1943000Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.1943279Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fb70916e000) 2025-05-07T20:03:54.1943399Z libtorch.so => not found 2025-05-07T20:03:54.1943482Z libc10.so => not found 2025-05-07T20:03:54.1943572Z libc10_cuda.so => not found 2025-05-07T20:03:54.1943672Z libtorch_cpu.so => not found 2025-05-07T20:03:54.1943764Z libtorch_cuda.so => not found 2025-05-07T20:03:54.1943991Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.1943996Z 2025-05-07T20:03:54.1944106Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.1944343Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so 2025-05-07T20:03:54.1944348Z 2025-05-07T20:03:54.1954374Z 2025-05-07T20:03:54.1954619Z Dynamic section at offset 0x11af470 contains 40 entries: 2025-05-07T20:03:54.1954748Z Tag Type Name/Value 2025-05-07T20:03:54.1954944Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.1955147Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.1955408Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:54.1955627Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:54.1955824Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.1956036Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.1956242Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.1956451Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.1956682Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.1956868Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:54.1957062Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.1957262Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.1957473Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.1957709Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_index_select.so] 2025-05-07T20:03:54.1957889Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.1958007Z 0x000000000000000c (INIT) 0x53000 2025-05-07T20:03:54.1958121Z 0x000000000000000d (FINI) 0x14c8cc 2025-05-07T20:03:54.1958238Z 0x0000000000000019 (INIT_ARRAY) 0x11ae010 2025-05-07T20:03:54.1958373Z 0x000000000000001b (INIT_ARRAYSZ) 144 (bytes) 2025-05-07T20:03:54.1958491Z 0x000000000000001a (FINI_ARRAY) 0x11ae0a0 2025-05-07T20:03:54.1958606Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.1958732Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:54.1958838Z 0x0000000000000005 (STRTAB) 0xb768 2025-05-07T20:03:54.1958943Z 0x0000000000000006 (SYMTAB) 0x2948 2025-05-07T20:03:54.1959077Z 0x000000000000000a (STRSZ) 240496 (bytes) 2025-05-07T20:03:54.1959204Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.1959317Z 0x0000000000000003 (PLTGOT) 0x11af730 2025-05-07T20:03:54.1959448Z 0x0000000000000002 (PLTRELSZ) 16896 (bytes) 2025-05-07T20:03:54.1959571Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.1959702Z 0x0000000000000017 (JMPREL) 0x4e360 2025-05-07T20:03:54.1959807Z 0x0000000000000007 (RELA) 0x47010 2025-05-07T20:03:54.1959936Z 0x0000000000000008 (RELASZ) 29520 (bytes) 2025-05-07T20:03:54.1960063Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.1960163Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.1960285Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.1960412Z 0x000000006ffffffe (VERNEED) 0x46eb0 2025-05-07T20:03:54.1960514Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:54.1960652Z 0x000000006ffffff0 (VERSYM) 0x462d8 2025-05-07T20:03:54.1960762Z 0x000000006ffffff9 (RELACOUNT) 213 2025-05-07T20:03:54.1960856Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.1960860Z 2025-05-07T20:03:54.1961002Z ################################################################################ 2025-05-07T20:03:54.1961007Z 2025-05-07T20:03:54.1961011Z 2025-05-07T20:03:54.1961128Z ################################################################################ 2025-05-07T20:03:54.1961442Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.1961566Z [CHECK] Listing out library size: 2025-05-07T20:03:54.1961876Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.1961880Z 2025-05-07T20:03:54.1967919Z 1 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.1968511Z 2025-05-07T20:03:54.1970190Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.1971849Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.1971858Z 2025-05-07T20:03:54.2028111Z GLIBC_2.2.5 2025-05-07T20:03:54.2029157Z GLIBC_2.3 2025-05-07T20:03:54.2029284Z GLIBC_2.14 2025-05-07T20:03:54.2029292Z 2025-05-07T20:03:54.2029298Z 2025-05-07T20:03:54.2029812Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.2030419Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.2030425Z 2025-05-07T20:03:54.2085858Z GLIBCXX_3.4 2025-05-07T20:03:54.2086136Z GLIBCXX_3.4.9 2025-05-07T20:03:54.2086418Z GLIBCXX_3.4.18 2025-05-07T20:03:54.2086648Z GLIBCXX_3.4.20 2025-05-07T20:03:54.2086902Z GLIBCXX_3.4.21 2025-05-07T20:03:54.2086921Z 2025-05-07T20:03:54.2086934Z 2025-05-07T20:03:54.2108127Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.GTr5doSh3S.symbols.txt 2025-05-07T20:03:54.2108203Z 2025-05-07T20:03:54.2134362Z 2025-05-07T20:03:54.2156512Z [CHECK] Total Number of symbols: 349 2025-05-07T20:03:54.2172284Z [CHECK] Number of fbgemm symbols: 57 2025-05-07T20:03:54.2187860Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so > /tmp/tmp.3Cj6j8LOAW.usymbols.txt 2025-05-07T20:03:54.2188394Z 2025-05-07T20:03:54.2203339Z 2025-05-07T20:03:54.2231865Z [CHECK] Listing out undefined symbols (123 total): 2025-05-07T20:03:54.2244100Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.2246530Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.2248165Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.2249174Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.2250368Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.2252005Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.2253114Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.2253493Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.2253876Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.2254262Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.2254601Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.2254918Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.2255224Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.2255599Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.2255915Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.2256246Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.2256604Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.2256919Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:54.2257254Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.2257561Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.2258419Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.2259784Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.2260737Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.2261163Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.2261517Z U c10::IntType::get() 2025-05-07T20:03:54.2261894Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.2262294Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.2262754Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.2263519Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.2264293Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.2264635Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.2264931Z U c10::TensorType::get() 2025-05-07T20:03:54.2265238Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.2266338Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.2267456Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.2267893Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.2268248Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.2268585Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.2268929Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.2269264Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.2269742Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.2270210Z U c10::cuda::device_count() 2025-05-07T20:03:54.2270560Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.2270951Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.2271335Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.2271764Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.2272170Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.2272558Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.2273455Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.2274346Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.2275231Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.2276236Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.2277467Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.2278300Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.2278657Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.2279012Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.2279398Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.2279792Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.2280157Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.2280568Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.2281026Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.2281408Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.2281790Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.2282159Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.2282510Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.2282866Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.2283222Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.2283608Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:54.2283996Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.2284357Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.2284706Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.2285048Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.2285415Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.2285778Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.2286171Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.2286630Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.2287080Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.2287460Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.2287752Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.2288079Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.2288431Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.2289024Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.2289903Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.2290828Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.2291941Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.2293010Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.2293942Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.2294942Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.2296078Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.2297088Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:54.2298004Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.2298639Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.2298981Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:54.2299592Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.2299972Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.2300371Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.2300887Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.2301773Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.2302551Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.2302903Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.2303230Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.2303561Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.2303942Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.2304454Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.2304914Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.2305235Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.2305539Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.2305827Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.2306606Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.2307860Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.2308634Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.2309335Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.2309989Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.2310440Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.2310874Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.2311277Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.2311848Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.2312465Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.2312971Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.2313467Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.2313769Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.2314091Z w __gmon_start__ 2025-05-07T20:03:54.2314358Z w __pthread_key_create 2025-05-07T20:03:54.2314697Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.2315186Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.2315579Z 2025-05-07T20:03:54.2315711Z linux-vdso.so.1 (0x00007ffe657fe000) 2025-05-07T20:03:54.2316012Z libtorch.so => not found 2025-05-07T20:03:54.2316251Z libc10.so => not found 2025-05-07T20:03:54.2316505Z libc10_cuda.so => not found 2025-05-07T20:03:54.2316762Z libtorch_cpu.so => not found 2025-05-07T20:03:54.2317069Z libtorch_cuda.so => not found 2025-05-07T20:03:54.2317327Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.2317662Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f94a6365000) 2025-05-07T20:03:54.2318086Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f94a630f000) 2025-05-07T20:03:54.2318487Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f94a62e1000) 2025-05-07T20:03:54.2318875Z libc.so.6 => /lib64/libc.so.6 (0x00007f94a60d9000) 2025-05-07T20:03:54.2319229Z /lib64/ld-linux-x86-64.so.2 (0x00007f94a6622000) 2025-05-07T20:03:54.2319579Z libm.so.6 => /lib64/libm.so.6 (0x00007f94a5ffe000) 2025-05-07T20:03:54.2319808Z 2025-05-07T20:03:54.2319913Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.2320385Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so 2025-05-07T20:03:54.2320764Z 2025-05-07T20:03:54.2320795Z 2025-05-07T20:03:54.2320946Z Dynamic section at offset 0x50440 contains 37 entries: 2025-05-07T20:03:54.2321308Z Tag Type Name/Value 2025-05-07T20:03:54.2321732Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.2322243Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.2322737Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.2323272Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.2323784Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.2324316Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.2324837Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.2325349Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:54.2325858Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.2326352Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.2326869Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.2327455Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_embedding_inplace_ops.so] 2025-05-07T20:03:54.2327940Z 0x000000000000000c (INIT) 0x10000 2025-05-07T20:03:54.2328265Z 0x000000000000000d (FINI) 0x2fa7c 2025-05-07T20:03:54.2328595Z 0x0000000000000019 (INIT_ARRAY) 0x50bf8 2025-05-07T20:03:54.2328938Z 0x000000000000001b (INIT_ARRAYSZ) 40 (bytes) 2025-05-07T20:03:54.2329270Z 0x000000000000001a (FINI_ARRAY) 0x50c20 2025-05-07T20:03:54.2329610Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.2329972Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:54.2330300Z 0x0000000000000005 (STRTAB) 0x2e30 2025-05-07T20:03:54.2330613Z 0x0000000000000006 (SYMTAB) 0xd60 2025-05-07T20:03:54.2330960Z 0x000000000000000a (STRSZ) 35916 (bytes) 2025-05-07T20:03:54.2331491Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.2331836Z 0x0000000000000003 (PLTGOT) 0x516e0 2025-05-07T20:03:54.2332194Z 0x0000000000000002 (PLTRELSZ) 5544 (bytes) 2025-05-07T20:03:54.2332527Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.2332916Z 0x0000000000000017 (JMPREL) 0xdc00 2025-05-07T20:03:54.2333232Z 0x0000000000000007 (RELA) 0xbe48 2025-05-07T20:03:54.2333576Z 0x0000000000000008 (RELASZ) 7608 (bytes) 2025-05-07T20:03:54.2333923Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.2334284Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.2334610Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.2334958Z 0x000000006ffffffe (VERNEED) 0xbd38 2025-05-07T20:03:54.2335287Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:54.2335638Z 0x000000006ffffff0 (VERSYM) 0xba7c 2025-05-07T20:03:54.2335963Z 0x000000006ffffff9 (RELACOUNT) 152 2025-05-07T20:03:54.2336256Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.2336468Z 2025-05-07T20:03:54.2336579Z ################################################################################ 2025-05-07T20:03:54.2336807Z 2025-05-07T20:03:54.2336814Z 2025-05-07T20:03:54.2336937Z ################################################################################ 2025-05-07T20:03:54.2337468Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.2337999Z [CHECK] Listing out library size: 2025-05-07T20:03:54.2338480Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.2338898Z 2025-05-07T20:03:54.2339147Z 492 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.2339494Z 2025-05-07T20:03:54.2339931Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.2341000Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.2341646Z 2025-05-07T20:03:54.4281113Z GLIBC_2.2.5 2025-05-07T20:03:54.4281771Z GLIBC_2.3 2025-05-07T20:03:54.4282328Z GLIBC_2.14 2025-05-07T20:03:54.4282650Z 2025-05-07T20:03:54.4282663Z 2025-05-07T20:03:54.4283552Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.4284672Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.4285344Z 2025-05-07T20:03:54.6230849Z GLIBCXX_3.4 2025-05-07T20:03:54.6231900Z GLIBCXX_3.4.9 2025-05-07T20:03:54.6232540Z GLIBCXX_3.4.11 2025-05-07T20:03:54.6233357Z GLIBCXX_3.4.14 2025-05-07T20:03:54.6233927Z GLIBCXX_3.4.15 2025-05-07T20:03:54.6234303Z GLIBCXX_3.4.18 2025-05-07T20:03:54.6234503Z GLIBCXX_3.4.20 2025-05-07T20:03:54.6234706Z GLIBCXX_3.4.21 2025-05-07T20:03:54.6234824Z 2025-05-07T20:03:54.6234829Z 2025-05-07T20:03:54.6252894Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.qEmQZV6hAE.symbols.txt 2025-05-07T20:03:54.6254510Z 2025-05-07T20:03:54.8175919Z 2025-05-07T20:03:54.8257137Z [CHECK] Total Number of symbols: 12554 2025-05-07T20:03:54.8349075Z [CHECK] Number of fbgemm symbols: 2318 2025-05-07T20:03:54.8362407Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so > /tmp/tmp.YQes1ZZhL1.usymbols.txt 2025-05-07T20:03:54.8363151Z 2025-05-07T20:03:54.8425968Z 2025-05-07T20:03:54.8453413Z [CHECK] Listing out undefined symbols (280 total): 2025-05-07T20:03:54.8469299Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8470166Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8470831Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.8471213Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8471812Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.8472218Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8472603Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.8473220Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.8473596Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.8473959Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.8474333Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.8474701Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.8475025Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.8475332Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.8475654Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.8475973Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.8476304Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.8476631Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.8476935Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.8477248Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:54.8477568Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.8477888Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.8478202Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.8478552Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:54.8478961Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:54.8479379Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:54.8479806Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:54.8480209Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:54.8480577Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:54.8480948Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:54.8481399Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:54.8482051Z U at::_ops::clamp::call(at::Tensor const&, std::optional const&, std::optional const&) 2025-05-07T20:03:54.8482652Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:54.8483533Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8484898Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8485942Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.8486929Z U at::_ops::sparse_coo_tensor_indices_size::call(at::Tensor const&, at::Tensor const&, c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8488074Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:54.8488602Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:54.8489001Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.8489724Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8490822Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.8491656Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:54.8492167Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:54.8492514Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:54.8492907Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:54.8493255Z U at::get_thread_num() 2025-05-07T20:03:54.8493568Z U at::globalContext() 2025-05-07T20:03:54.8493857Z U at::internal::set_thread_num(int) 2025-05-07T20:03:54.8494184Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:54.8494578Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:54.8494970Z U at::toAccumulateType(c10::ScalarType, bool) 2025-05-07T20:03:54.8495297Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:54.8495566Z U c10::AnyType::get() 2025-05-07T20:03:54.8495945Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8496339Z U c10::BoolType::get() 2025-05-07T20:03:54.8496694Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.8497130Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:54.8497521Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:54.8498239Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:54.8499449Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:54.8500501Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.8501078Z U c10::Error::what() const 2025-05-07T20:03:54.8501362Z U c10::FloatType::get() 2025-05-07T20:03:54.8501686Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:54.8502013Z U c10::GradMode::is_enabled() 2025-05-07T20:03:54.8502313Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:54.8502667Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8503077Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8503513Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:54.8503912Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:54.8504229Z U c10::IValue::isBoolList() const 2025-05-07T20:03:54.8504560Z U c10::IValue::isIntList() const 2025-05-07T20:03:54.8504871Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:54.8505212Z U c10::IValue::isTensorList() const 2025-05-07T20:03:54.8505556Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.8505941Z U c10::IntType::get() 2025-05-07T20:03:54.8506302Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.8506690Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.8507046Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.8507396Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.8507848Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.8508283Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:54.8508684Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:54.8509198Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:54.8509744Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.8510102Z U c10::StringType::get() 2025-05-07T20:03:54.8510425Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.8510814Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:54.8511530Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.8512140Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.8512491Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.8512913Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.8513424Z U c10::SymIntType::get() 2025-05-07T20:03:54.8513806Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:54.8514202Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:54.8514623Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.8515001Z U c10::TensorType::get() 2025-05-07T20:03:54.8515349Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.8516321Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.8517325Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.8517711Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.8518061Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.8518421Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.8518764Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.8519121Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.8519620Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.8520095Z U c10::cuda::device_count() 2025-05-07T20:03:54.8520464Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.8520858Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.8521267Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.8521677Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.8522089Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.8522497Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.8523170Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.8524298Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.8525222Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.8526101Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8527078Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.8528146Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8528995Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:54.8529341Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:54.8529940Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:54.8530588Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:54.8531055Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:54.8531784Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:54.8532205Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:54.8532552Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.8532956Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:54.8533623Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.8534240Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:54.8534625Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:54.8535024Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:54.8535453Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:54.8535884Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:54.8536261Z U c10::operator<=(c10::SymInt const&, int) 2025-05-07T20:03:54.8536629Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:54.8536982Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:54.8537340Z U c10::throwNullDataPtrError() 2025-05-07T20:03:54.8537680Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:54.8538008Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:54.8538437Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:54.8538869Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:54.8539242Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.8539624Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8540022Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:54.8540413Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:54.8540772Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:54.8541142Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:54.8541485Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:54.8541848Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8542214Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:54.8542604Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:54.8542980Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:54.8543445Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:54.8543784Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:54.8544160Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:54.8544507Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:54.8544849Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:54.8545808Z U fbgemm::EmbeddingSpMDMKernelSignature::Type fbgemm::GenerateEmbeddingSpMDMWithStrides(long, bool, bool, int, bool, bool, long, long, bool, bool, bool, bool) 2025-05-07T20:03:54.8546976Z U fbgemm::SparseAdaGradSignature::Type fbgemm::GenerateSparseAdaGrad(int, bool, int, bool) 2025-05-07T20:03:54.8547548Z U fbgemm::fbgemmAlignedFree(void*) 2025-05-07T20:03:54.8547950Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:54.8548358Z U float at::Tensor::item() const 2025-05-07T20:03:54.8548761Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8549168Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8549520Z U free@GLIBC_2.2.5 2025-05-07T20:03:54.8551070Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8551442Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8551868Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:54.8552264Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.8552739Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.8553276Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:54.8553564Z U memcpy@GLIBC_2.14 2025-05-07T20:03:54.8571615Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:54.8571973Z U memset@GLIBC_2.2.5 2025-05-07T20:03:54.8572272Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:54.8572591Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:54.8573117Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:54.8573810Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:54.8574505Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:54.8575224Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:54.8575931Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:54.8576635Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:54.8577134Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:54.8577747Z U split_embedding_codegen_forward_cpu(at::Tensor, at::Tensor, at::Tensor, c10::SymInt, at::Tensor, at::Tensor, at::Tensor, long, at::Tensor, long) 2025-05-07T20:03:54.8578666Z U split_embedding_codegen_grad_indice_weights_cpu(at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor, at::Tensor) 2025-05-07T20:03:54.8579246Z U sqrt@GLIBC_2.2.5 2025-05-07T20:03:54.8579513Z U sqrtf@GLIBC_2.2.5 2025-05-07T20:03:54.8579889Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:54.8580515Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:54.8581314Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:54.8582249Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8583245Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.8584227Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8585082Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8586241Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:54.8587874Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8589278Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8590286Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:54.8591113Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:54.8591729Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:54.8592064Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:54.8592431Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:54.8592953Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8593354Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:54.8593769Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:54.8594202Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:54.8594598Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:54.8595088Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:54.8596041Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8596872Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:54.8597230Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8597588Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:54.8597930Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:54.8598270Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:54.8598676Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8599220Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:54.8599701Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:54.8600107Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8600526Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:54.8601206Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:54.8601900Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:54.8602299Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:54.8602606Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:54.8602903Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:54.8603214Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:54.8604049Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:54.8605299Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8606183Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:54.8606720Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:54.8607251Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:54.8607856Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:54.8608387Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:54.8608902Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:54.8609568Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:54.8610184Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:54.8610641Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:54.8611116Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:54.8611539Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:54.8611893Z U torch::autograd::Node::metadata() 2025-05-07T20:03:54.8612252Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:54.8612752Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:54.8613390Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:54.8613923Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:54.8614394Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:54.8614940Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:54.8618084Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:54.8620875Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:54.8621269Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:54.8621670Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:54.8622086Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:54.8622759Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:54.8623592Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:54.8624570Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:54.8625305Z U typeinfo for c10::Error 2025-05-07T20:03:54.8625631Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:54.8626021Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:54.8626357Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:54.8626735Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:54.8627079Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:54.8628324Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:54.8630453Z U void internal::csr2csc(internal::HyperCompressedSparseColumn&, int, at::TensorAccessor const&, at::TensorAccessor const&, at::TensorAccessor const&, long, int const*, long) 2025-05-07T20:03:54.8632328Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8632847Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:54.8633299Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:54.8633749Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:54.8634119Z U vtable for c10::Error 2025-05-07T20:03:54.8634680Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.8635259Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:54.8635745Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:54.8636219Z U vtable for torch::autograd::Node 2025-05-07T20:03:54.8636625Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:54.8637043Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:54.8637369Z w _ITM_registerTMCloneTable 2025-05-07T20:03:54.8637695Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:54.8637990Z w __gmon_start__ 2025-05-07T20:03:54.8638280Z w __pthread_key_create 2025-05-07T20:03:54.8638584Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:54.8638927Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:54.8639301Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:54.8639797Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.8640160Z 2025-05-07T20:03:54.8640284Z linux-vdso.so.1 (0x00007fffcb5b5000) 2025-05-07T20:03:54.8640570Z libc10.so => not found 2025-05-07T20:03:54.8640820Z libc10_cuda.so => not found 2025-05-07T20:03:54.8641479Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f87d9e0a000) 2025-05-07T20:03:54.8642635Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f87d9d12000) 2025-05-07T20:03:54.8643487Z libtorch.so => not found 2025-05-07T20:03:54.8644008Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f87d9600000) 2025-05-07T20:03:54.8645076Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f87d8c00000) 2025-05-07T20:03:54.8645824Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8646088Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8646348Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8646658Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f87d899c000) 2025-05-07T20:03:54.8647084Z libm.so.6 => /lib64/libm.so.6 (0x00007f87d9c37000) 2025-05-07T20:03:54.8647428Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f87fa183000) 2025-05-07T20:03:54.8647781Z libc.so.6 => /lib64/libc.so.6 (0x00007f87d8794000) 2025-05-07T20:03:54.8648169Z /lib64/ld-linux-x86-64.so.2 (0x00007f87fa1b9000) 2025-05-07T20:03:54.8648474Z libc10.so => not found 2025-05-07T20:03:54.8648689Z libc10_cuda.so => not found 2025-05-07T20:03:54.8649273Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f87fa178000) 2025-05-07T20:03:54.8649890Z libtorch.so => not found 2025-05-07T20:03:54.8650147Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8650401Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8650642Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8650947Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f87d9be1000) 2025-05-07T20:03:54.8651257Z libc10.so => not found 2025-05-07T20:03:54.8651484Z libc10_cuda.so => not found 2025-05-07T20:03:54.8651720Z libtorch.so => not found 2025-05-07T20:03:54.8651956Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8652194Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8652439Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8652682Z libc10.so => not found 2025-05-07T20:03:54.8653146Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f87d9b69000) 2025-05-07T20:03:54.8653665Z libtorch.so => not found 2025-05-07T20:03:54.8653892Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8654138Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8654378Z libtorch.so => not found 2025-05-07T20:03:54.8654609Z libc10.so => not found 2025-05-07T20:03:54.8654830Z libc10_cuda.so => not found 2025-05-07T20:03:54.8655074Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8655323Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8655560Z libcudart.so.11.0 => not found 2025-05-07T20:03:54.8655809Z libtorch.so => not found 2025-05-07T20:03:54.8656025Z libc10.so => not found 2025-05-07T20:03:54.8656248Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8656483Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8656802Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f87fa169000) 2025-05-07T20:03:54.8657149Z libtorch_cpu.so => not found 2025-05-07T20:03:54.8657396Z libtorch_cuda.so => not found 2025-05-07T20:03:54.8657629Z libtorch.so => not found 2025-05-07T20:03:54.8657894Z librt.so.1 => /lib64/librt.so.1 (0x00007f87fa162000) 2025-05-07T20:03:54.8658115Z 2025-05-07T20:03:54.8658227Z [CHECK] Displaying ELF information: 2025-05-07T20:03:54.8658653Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so 2025-05-07T20:03:54.8659006Z 2025-05-07T20:03:54.8659018Z 2025-05-07T20:03:54.8659171Z Dynamic section at offset 0x1eb9cd68 contains 42 entries: 2025-05-07T20:03:54.8659522Z Tag Type Name/Value 2025-05-07T20:03:54.8659914Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:54.8660386Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:54.8660882Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_common.so] 2025-05-07T20:03:54.8661435Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:54.8661983Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:54.8662437Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:54.8662918Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:54.8663407Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:54.8663890Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:54.8664376Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:54.8664853Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:54.8665342Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:54.8665789Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:54.8666275Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:54.8666746Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:54.8667291Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:54.8667839Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:54.8668208Z 0x000000000000000c (INIT) 0x5b0000 2025-05-07T20:03:54.8668527Z 0x000000000000000d (FINI) 0x2ee447c 2025-05-07T20:03:54.8668842Z 0x0000000000000019 (INIT_ARRAY) 0x1eb90820 2025-05-07T20:03:54.8669178Z 0x000000000000001b (INIT_ARRAYSZ) 1824 (bytes) 2025-05-07T20:03:54.8669504Z 0x000000000000001a (FINI_ARRAY) 0x1eb90f40 2025-05-07T20:03:54.8669830Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:54.8670151Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:54.8670447Z 0x0000000000000005 (STRTAB) 0x5ab08 2025-05-07T20:03:54.8670750Z 0x0000000000000006 (SYMTAB) 0x11200 2025-05-07T20:03:54.8671074Z 0x000000000000000a (STRSZ) 5105620 (bytes) 2025-05-07T20:03:54.8671416Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:54.8671735Z 0x0000000000000003 (PLTGOT) 0x1eb9e048 2025-05-07T20:03:54.8672082Z 0x0000000000000002 (PLTRELSZ) 63264 (bytes) 2025-05-07T20:03:54.8672411Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:54.8672823Z 0x0000000000000017 (JMPREL) 0x59f9b0 2025-05-07T20:03:54.8673330Z 0x0000000000000007 (RELA) 0x53f668 2025-05-07T20:03:54.8673677Z 0x0000000000000008 (RELASZ) 394056 (bytes) 2025-05-07T20:03:54.8674052Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:54.8674367Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:54.8674696Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:54.8675043Z 0x000000006ffffffe (VERNEED) 0x53f4f8 2025-05-07T20:03:54.8675375Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:54.8675698Z 0x000000006ffffff0 (VERSYM) 0x5392dc 2025-05-07T20:03:54.8676028Z 0x000000006ffffff9 (RELACOUNT) 2708 2025-05-07T20:03:54.8676344Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:54.8676543Z 2025-05-07T20:03:54.8676662Z ################################################################################ 2025-05-07T20:03:54.8676894Z 2025-05-07T20:03:54.8676899Z 2025-05-07T20:03:54.8677007Z ################################################################################ 2025-05-07T20:03:54.8677460Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:54.8677907Z [CHECK] Listing out library size: 2025-05-07T20:03:54.8678314Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:54.8678650Z 2025-05-07T20:03:54.8678812Z 40 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:54.8679081Z 2025-05-07T20:03:54.8679474Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:54.8680381Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.8680941Z 2025-05-07T20:03:54.8994903Z GLIBC_2.2.5 2025-05-07T20:03:54.8995542Z GLIBC_2.3 2025-05-07T20:03:54.8996067Z GLIBC_2.14 2025-05-07T20:03:54.8996389Z 2025-05-07T20:03:54.8996418Z 2025-05-07T20:03:54.8997524Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:54.9000675Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:54.9002424Z 2025-05-07T20:03:54.9387571Z GLIBCXX_3.4 2025-05-07T20:03:54.9388572Z GLIBCXX_3.4.9 2025-05-07T20:03:54.9389153Z GLIBCXX_3.4.11 2025-05-07T20:03:54.9389738Z GLIBCXX_3.4.14 2025-05-07T20:03:54.9390303Z GLIBCXX_3.4.15 2025-05-07T20:03:54.9390886Z GLIBCXX_3.4.18 2025-05-07T20:03:54.9391380Z GLIBCXX_3.4.19 2025-05-07T20:03:54.9391579Z GLIBCXX_3.4.20 2025-05-07T20:03:54.9391765Z GLIBCXX_3.4.21 2025-05-07T20:03:54.9391890Z 2025-05-07T20:03:54.9391895Z 2025-05-07T20:03:54.9406348Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.2uSqgqKKB2.symbols.txt 2025-05-07T20:03:54.9407739Z 2025-05-07T20:03:54.9737382Z 2025-05-07T20:03:54.9763365Z [CHECK] Total Number of symbols: 6602 2025-05-07T20:03:54.9787223Z [CHECK] Number of fbgemm symbols: 4516 2025-05-07T20:03:54.9804331Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so > /tmp/tmp.yWLPvwgqzD.usymbols.txt 2025-05-07T20:03:54.9804843Z 2025-05-07T20:03:54.9844156Z 2025-05-07T20:03:54.9869089Z [CHECK] Listing out undefined symbols (472 total): 2025-05-07T20:03:54.9890458Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.9891431Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:54.9892059Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:54.9892383Z U __assert_fail@GLIBC_2.2.5 2025-05-07T20:03:54.9892737Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.9893142Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:54.9893539Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.9893934Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:54.9894315Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:54.9894682Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:54.9895057Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:54.9895415Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:54.9895739Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:54.9896037Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:54.9896350Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:54.9896663Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:54.9896980Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:54.9897296Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:54.9897608Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:54.9897930Z U __cxa_pure_virtual@CXXABI_1.3 2025-05-07T20:03:54.9898234Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:54.9898543Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:54.9898852Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:54.9899155Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:54.9899474Z U __once_proxy@GLIBCXX_3.4.11 2025-05-07T20:03:54.9899952Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:54.9900343Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:54.9900754Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:54.9901132Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:54.9901502Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:54.9901876Z U at::SplitUntil32Bit::begin() const 2025-05-07T20:03:54.9902305Z U at::SplitUntil32Bit::end() const 2025-05-07T20:03:54.9902644Z U at::SplitUntil32Bit::iterator::operator*() const 2025-05-07T20:03:54.9903059Z U at::SplitUntil32Bit::iterator::operator++() 2025-05-07T20:03:54.9903489Z U at::Tensor::index(std::initializer_list) const 2025-05-07T20:03:54.9904022Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:54.9904468Z U at::TensorIteratorBase::build(at::TensorIteratorConfig&) 2025-05-07T20:03:54.9904885Z U at::TensorIteratorBase::can_use_32bit_indexing() const 2025-05-07T20:03:54.9905301Z U at::TensorIteratorBase::data_ptr(long) const 2025-05-07T20:03:54.9905657Z U at::TensorIteratorBase::is_contiguous() const 2025-05-07T20:03:54.9906002Z U at::TensorIteratorBase::numel() const 2025-05-07T20:03:54.9906350Z U at::TensorIteratorBase::with_32bit_indexing() const 2025-05-07T20:03:54.9906803Z U at::TensorIteratorConfig::add_borrowed_input(at::TensorBase const&) 2025-05-07T20:03:54.9907322Z U at::TensorIteratorConfig::add_borrowed_output(at::TensorBase const&) 2025-05-07T20:03:54.9907734Z U at::TensorMaker::make_tensor() 2025-05-07T20:03:54.9908062Z U at::_ops::_is_all_true::call(at::Tensor const&) 2025-05-07T20:03:54.9908426Z U at::_ops::_unique::call(at::Tensor const&, bool, bool) 2025-05-07T20:03:54.9908894Z U at::_ops::add_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.9909435Z U at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.9909847Z U at::_ops::all::call(at::Tensor const&) 2025-05-07T20:03:54.9910386Z U at::_ops::baddbmm::call(at::Tensor const&, at::Tensor const&, at::Tensor const&, c10::Scalar const&, c10::Scalar const&) 2025-05-07T20:03:54.9911001Z U at::_ops::broadcast_to::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.9911457Z U at::_ops::cat::call(c10::IListRef const&, long) 2025-05-07T20:03:54.9911900Z U at::_ops::cat_out::call(c10::IListRef const&, long, at::Tensor&) 2025-05-07T20:03:54.9912355Z U at::_ops::clamp_max::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.9912963Z U at::_ops::clone::call(at::Tensor const&, std::optional) 2025-05-07T20:03:54.9913631Z U at::_ops::contiguous::call(at::Tensor const&, c10::MemoryFormat) 2025-05-07T20:03:54.9914100Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:54.9914615Z U at::_ops::cumsum::call(at::Tensor const&, long, std::optional) 2025-05-07T20:03:54.9915310Z U at::_ops::diff::call(at::Tensor const&, long, long, std::optional const&, std::optional const&) 2025-05-07T20:03:54.9915960Z U at::_ops::div_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.9916862Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9918262Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9919231Z U at::_ops::fill__Scalar::call(at::Tensor&, c10::Scalar const&) 2025-05-07T20:03:54.9919803Z U at::_ops::flatten_using_ints::call(at::Tensor const&, long, long) 2025-05-07T20:03:54.9920204Z U at::_ops::floor::call(at::Tensor const&) 2025-05-07T20:03:54.9920945Z U at::_ops::full::call(c10::ArrayRef, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9921771Z U at::_ops::ge_Scalar::call(at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.9922376Z U at::_ops::index_put_::call(at::Tensor&, c10::List > const&, at::Tensor const&, bool) 2025-05-07T20:03:54.9922981Z U at::_ops::index_select::call(at::Tensor const&, long, at::Tensor const&) 2025-05-07T20:03:54.9923401Z U at::_ops::item::call(at::Tensor const&) 2025-05-07T20:03:54.9923814Z U at::_ops::le_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.9924193Z U at::_ops::max::call(at::Tensor const&) 2025-05-07T20:03:54.9924585Z U at::_ops::mul_Tensor::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.9925414Z U at::_ops::ones_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9926223Z U at::_ops::permute::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.9927007Z U at::_ops::range::call(c10::Scalar const&, c10::Scalar const&, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9927797Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.9928366Z U at::_ops::resize_::call(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:54.9928941Z U at::_ops::select_int::call(at::Tensor const&, long, c10::SymInt) 2025-05-07T20:03:54.9929645Z U at::_ops::set__source_Storage_storage_offset::call(at::Tensor&, c10::Storage, c10::SymInt, c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.9930510Z U at::_ops::slice_Tensor::call(at::Tensor const&, long, std::optional, std::optional, c10::SymInt) 2025-05-07T20:03:54.9931110Z U at::_ops::sort::call(at::Tensor const&, long, bool) 2025-05-07T20:03:54.9931960Z U at::_ops::split_sizes::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:54.9932463Z U at::_ops::squeeze_dim::call(at::Tensor const&, long) 2025-05-07T20:03:54.9932988Z U at::_ops::sub_Tensor::call(at::Tensor const&, at::Tensor const&, c10::Scalar const&) 2025-05-07T20:03:54.9933541Z U at::_ops::sum::call(at::Tensor const&, std::optional) 2025-05-07T20:03:54.9934137Z U at::_ops::tensor_split_indices::call(at::Tensor const&, c10::ArrayRef, long) 2025-05-07T20:03:54.9934843Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:54.9935921Z U at::_ops::to_dtype_layout::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, bool, bool, std::optional) 2025-05-07T20:03:54.9936871Z U at::_ops::transpose_int::call(at::Tensor const&, long, long) 2025-05-07T20:03:54.9937505Z U at::_ops::unique_consecutive::call(at::Tensor const&, bool, bool, std::optional) 2025-05-07T20:03:54.9938144Z U at::_ops::unsqueeze::call(at::Tensor const&, long) 2025-05-07T20:03:54.9938589Z U at::_ops::view::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:54.9939223Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:54.9939631Z U at::_ops::zero_::call(at::Tensor&) 2025-05-07T20:03:54.9940365Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9941604Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9942643Z U at::checkScalarTypes(char const*, at::TensorArg const&, c10::ArrayRef) 2025-05-07T20:03:54.9943178Z U at::cuda::getCurrentCUDABlasHandle() 2025-05-07T20:03:54.9943599Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:54.9943996Z U at::cuda::getDeviceProperties(signed char) 2025-05-07T20:03:54.9944392Z U at::cuda::get_p2p_access(signed char, signed char) 2025-05-07T20:03:54.9945017Z U at::detail::computeStorageNbytes(c10::ArrayRef, c10::ArrayRef, unsigned long, unsigned long) 2025-05-07T20:03:54.9945631Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:54.9946030Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:54.9946420Z U at::get_num_threads() 2025-05-07T20:03:54.9946730Z U at::get_thread_num() 2025-05-07T20:03:54.9947286Z U at::internal::OpaqueOptionalTensorRef::~OpaqueOptionalTensorRef() 2025-05-07T20:03:54.9947832Z U at::internal::set_thread_num(int) 2025-05-07T20:03:54.9948280Z U at::native::_rowwise_prune(at::Tensor const&, at::Tensor const&, c10::ScalarType) 2025-05-07T20:03:54.9949175Z U at::native::empty_like(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9950416Z U at::native::empty_meta_symint(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:54.9951394Z U at::native::resize_(at::Tensor const&, c10::ArrayRef, std::optional) 2025-05-07T20:03:54.9951926Z U at::print(std::ostream&, at::Tensor const&, long) 2025-05-07T20:03:54.9952298Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:54.9952777Z U at::tensor(c10::ArrayRef, c10::TensorOptions const&) 2025-05-07T20:03:54.9953317Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:54.9953648Z U bool at::Tensor::item() const 2025-05-07T20:03:54.9954068Z U bool* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.9954474Z U bool* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.9954866Z U c10::AnyType::get() 2025-05-07T20:03:54.9955234Z U c10::AutogradMetaInterface::~AutogradMetaInterface() 2025-05-07T20:03:54.9955730Z U c10::BFloat16* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.9956331Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.9956746Z U c10::BoolType::get() 2025-05-07T20:03:54.9957062Z U c10::DeviceObjType::get() 2025-05-07T20:03:54.9957469Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:54.9957937Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:54.9958343Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:54.9959100Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:54.9960425Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:54.9961499Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:54.9962077Z U c10::Error::what() const 2025-05-07T20:03:54.9962368Z U c10::FloatType::get() 2025-05-07T20:03:54.9962648Z U c10::GradMode::is_enabled() 2025-05-07T20:03:54.9962948Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:54.9963313Z U c10::Half* at::TensorBase::data_ptr() const 2025-05-07T20:03:54.9963723Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:54.9964139Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:54.9964494Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:54.9964807Z U c10::IValue::isBoolList() const 2025-05-07T20:03:54.9965107Z U c10::IValue::isIntList() const 2025-05-07T20:03:54.9965414Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:54.9965722Z U c10::IValue::isTensorList() const 2025-05-07T20:03:54.9966066Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:54.9966414Z U c10::InferenceMode::is_enabled() 2025-05-07T20:03:54.9966708Z U c10::IntType::get() 2025-05-07T20:03:54.9967347Z U c10::ListType::get(std::__cxx11::basic_string, std::allocator > const&, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.9968049Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:54.9968429Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:54.9968753Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.9969078Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:54.9969501Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.9969930Z U c10::ParallelGuard::ParallelGuard(bool) 2025-05-07T20:03:54.9970266Z U c10::ParallelGuard::~ParallelGuard() 2025-05-07T20:03:54.9970581Z U c10::ScalarTypeType::get() 2025-05-07T20:03:54.9971035Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:54.9971697Z U c10::SmallVectorBase::mallocForGrow(unsigned long, unsigned long, unsigned long&) 2025-05-07T20:03:54.9972244Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.9972599Z U c10::StringType::get() 2025-05-07T20:03:54.9972923Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:54.9973287Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:54.9973663Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:54.9974280Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:54.9974893Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:54.9975270Z U c10::SymInt::operator%(c10::SymInt const&) const 2025-05-07T20:03:54.9975621Z U c10::SymInt::operator*(c10::SymInt const&) const 2025-05-07T20:03:54.9975976Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:54.9976301Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:54.9976635Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:54.9976967Z U c10::SymInt::sym_ne(c10::SymInt const&) const 2025-05-07T20:03:54.9977291Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:54.9977606Z U c10::SymIntType::get() 2025-05-07T20:03:54.9977934Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:54.9978290Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:54.9978948Z U c10::TensorImpl::set_autograd_meta(std::unique_ptr >) 2025-05-07T20:03:54.9979796Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:54.9980160Z U c10::TensorType::get() 2025-05-07T20:03:54.9981161Z U c10::TupleType::TupleType(std::vector, std::allocator > >, std::optional, std::shared_ptr) 2025-05-07T20:03:54.9982500Z U c10::Type::isSubtypeOfExt(c10::Type const&, std::ostream*) const 2025-05-07T20:03:54.9982925Z U c10::Type::is_module() const 2025-05-07T20:03:54.9983267Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:54.9984233Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:54.9985204Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:54.9985623Z U c10::cuda::CUDAKernelLaunchRegistry::get_singleton_ref() 2025-05-07T20:03:54.9986175Z U c10::cuda::CUDAKernelLaunchRegistry::get_uvm_assertions_ptr_for_current_device() 2025-05-07T20:03:54.9986895Z U c10::cuda::CUDAKernelLaunchRegistry::insert(char const*, char const*, unsigned int, char const*, int) 2025-05-07T20:03:54.9987463Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:54.9987800Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:54.9988154Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:54.9988503Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:54.9988841Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:54.9989323Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:54.9989796Z U c10::cuda::current_device() 2025-05-07T20:03:54.9990111Z U c10::cuda::device_count() 2025-05-07T20:03:54.9990452Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:54.9990838Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:54.9991233Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:54.9991623Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:54.9992030Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:54.9992417Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:54.9993173Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:54.9994317Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:54.9995206Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:54.9996084Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.9997046Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:54.9998096Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:54.9999111Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:54.9999813Z U c10::impl::ExcludeDispatchKeyGuard::ExcludeDispatchKeyGuard(c10::DispatchKeySet) 2025-05-07T20:03:55.0000397Z U c10::impl::ExcludeDispatchKeyGuard::~ExcludeDispatchKeyGuard() 2025-05-07T20:03:55.0000837Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.0001196Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.0001748Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.0002377Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.0002792Z U c10::impl::PyObjectSlot::PyObjectSlot() 2025-05-07T20:03:55.0003161Z U c10::impl::PyObjectSlot::~PyObjectSlot() 2025-05-07T20:03:55.0003541Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:55.0003969Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:55.0004374Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.0004712Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.0005098Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.0005739Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.0006362Z U c10::operator*(c10::SymInt const&, int) 2025-05-07T20:03:55.0006712Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:55.0007076Z U c10::operator+(c10::SymInt const&, unsigned long) 2025-05-07T20:03:55.0007458Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.0007822Z U c10::operator-(c10::SymInt const&, unsigned long) 2025-05-07T20:03:55.0008309Z U c10::operator/(c10::SymInt const&, int) 2025-05-07T20:03:55.0008642Z U c10::operator<(c10::SymInt const&, int) 2025-05-07T20:03:55.0009166Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.0009550Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.0009941Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.0010351Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.0010712Z U c10::operator==(c10::SymInt const&, int) 2025-05-07T20:03:55.0011052Z U c10::operator>(c10::SymInt const&, int) 2025-05-07T20:03:55.0011388Z U c10::operator>=(c10::SymInt const&, int) 2025-05-07T20:03:55.0011720Z U c10::report_overflow(char const*) 2025-05-07T20:03:55.0012044Z U c10::throwNullDataPtrError() 2025-05-07T20:03:55.0012366Z U c10::typeKindToString(c10::TypeKind) 2025-05-07T20:03:55.0012687Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.0013030Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:55.0013442Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.0013863Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.0014184Z U cublasGemmStridedBatchedEx 2025-05-07T20:03:55.0014497Z U cublasSetStream_v2 2025-05-07T20:03:55.0014803Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.0015172Z U cudaDeviceGetByPCIBusId@libcudart.so.11.0 2025-05-07T20:03:55.0015527Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.0017372Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.0017723Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.0018067Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.0018443Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.0018766Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.0019114Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.0019445Z U cudaFree@libcudart.so.11.0 2025-05-07T20:03:55.0019805Z U cudaFuncGetAttributes@libcudart.so.11.0 2025-05-07T20:03:55.0020155Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.0020491Z U cudaGetDevice@libcudart.so.11.0 2025-05-07T20:03:55.0020835Z U cudaGetDeviceCount@libcudart.so.11.0 2025-05-07T20:03:55.0021186Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.0021549Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.0021883Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.0022236Z U cudaHostGetDevicePointer@libcudart.so.11.0 2025-05-07T20:03:55.0022587Z U cudaHostRegister@libcudart.so.11.0 2025-05-07T20:03:55.0022928Z U cudaHostUnregister@libcudart.so.11.0 2025-05-07T20:03:55.0023261Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.0023594Z U cudaMallocManaged@libcudart.so.11.0 2025-05-07T20:03:55.0023931Z U cudaMemAdvise@libcudart.so.11.0 2025-05-07T20:03:55.0024267Z U cudaMemPrefetchAsync@libcudart.so.11.0 2025-05-07T20:03:55.0024614Z U cudaMemcpy2DAsync@libcudart.so.11.0 2025-05-07T20:03:55.0024940Z U cudaMemsetAsync@libcudart.so.11.0 2025-05-07T20:03:55.0025443Z U cudaOccupancyMaxActiveBlocksPerMultiprocessorWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.0025963Z U cudaPeekAtLastError@libcudart.so.11.0 2025-05-07T20:03:55.0026297Z U cudaSetDevice@libcudart.so.11.0 2025-05-07T20:03:55.0026626Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.0026964Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.0027318Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.0027680Z U double* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0028090Z U double* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0028451Z U exit@GLIBC_2.2.5 2025-05-07T20:03:55.0028719Z U exp10@GLIBC_2.2.5 2025-05-07T20:03:55.0028989Z U exp2@GLIBC_2.2.5 2025-05-07T20:03:55.0029241Z U exp@GLIBC_2.2.5 2025-05-07T20:03:55.0029503Z U expf@GLIBC_2.2.5 2025-05-07T20:03:55.0029864Z U fbgemm_gpu::asynchronous_complete_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:55.0030362Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.0030851Z U fbgemm_gpu::asynchronous_exclusive_cumsum_cpu(at::Tensor const&) 2025-05-07T20:03:55.0031704Z U fbgemm_gpu::asynchronous_exclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.0032335Z U fbgemm_gpu::asynchronous_inclusive_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.0032872Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0033285Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0033639Z U fmod@GLIBC_2.2.5 2025-05-07T20:03:55.0033912Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.0034199Z U get_info_B_num_bits_from_T(int, int) 2025-05-07T20:03:55.0034540Z U int at::Tensor::item() const 2025-05-07T20:03:55.0034959Z U int const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:55.0035353Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0035736Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0036099Z U isnan@GLIBC_2.2.5 2025-05-07T20:03:55.0036376Z U lgamma@GLIBC_2.2.5 2025-05-07T20:03:55.0036650Z U llrint@GLIBC_2.2.5 2025-05-07T20:03:55.0036933Z U llround@GLIBC_2.2.5 2025-05-07T20:03:55.0037214Z U log10@GLIBC_2.2.5 2025-05-07T20:03:55.0037507Z U log2@GLIBC_2.2.5 2025-05-07T20:03:55.0037779Z U log@GLIBC_2.2.5 2025-05-07T20:03:55.0038040Z U logl@GLIBC_2.2.5 2025-05-07T20:03:55.0038334Z U long at::Tensor::item() const 2025-05-07T20:03:55.0038725Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.0039193Z U long const* at::TensorBase::const_data_ptr() const 2025-05-07T20:03:55.0039609Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0039996Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0040350Z U lrint@GLIBC_2.2.5 2025-05-07T20:03:55.0040623Z U madvise@GLIBC_2.2.5 2025-05-07T20:03:55.0040909Z U malloc@GLIBC_2.2.5 2025-05-07T20:03:55.0041177Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:55.0041454Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.0041722Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.0041999Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.0042283Z U nextafter@GLIBC_2.2.5 2025-05-07T20:03:55.0042587Z U nvmlDeviceGetCount_v2 2025-05-07T20:03:55.0042909Z U nvmlDeviceGetHandleByIndex_v2 2025-05-07T20:03:55.0043259Z U nvmlDeviceGetNvLinkRemotePciInfo_v2 2025-05-07T20:03:55.0043611Z U nvmlDeviceGetNvLinkState 2025-05-07T20:03:55.0043919Z U nvmlDeviceGetPciInfo_v3 2025-05-07T20:03:55.0044225Z U nvmlInit_v2 2025-05-07T20:03:55.0044499Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.0044853Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.0045334Z U operator new[](unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.0045650Z U pow@GLIBC_2.2.5 2025-05-07T20:03:55.0045928Z U printf@GLIBC_2.2.5 2025-05-07T20:03:55.0046198Z U puts@GLIBC_2.2.5 2025-05-07T20:03:55.0046472Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:55.0046800Z U short* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0047270Z U signed char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0047671Z U sin@GLIBC_2.2.5 2025-05-07T20:03:55.0048052Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.0048548Z U std::_Rb_tree_decrement(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:55.0049005Z U std::_Rb_tree_increment(std::_Rb_tree_node_base const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0049511Z U std::_Rb_tree_increment(std::_Rb_tree_node_base*)@GLIBCXX_3.4 2025-05-07T20:03:55.0050172Z U std::_Rb_tree_insert_and_rebalance(bool, std::_Rb_tree_node_base*, std::_Rb_tree_node_base*, std::_Rb_tree_node_base&)@GLIBCXX_3.4 2025-05-07T20:03:55.0050985Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.0051820Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.0052720Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0053780Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0054834Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0055783Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0056732Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.0057802Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0058959Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0059931Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.0060742Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.0061599Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.0062179Z U std::__once_call@GLIBCXX_3.4.11 2025-05-07T20:03:55.0062505Z U std::__once_callable@GLIBCXX_3.4.11 2025-05-07T20:03:55.0062833Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.0063160Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.0063483Z U std::__throw_bad_cast()@GLIBCXX_3.4 2025-05-07T20:03:55.0063823Z U std::__throw_bad_function_call()@GLIBCXX_3.4.14 2025-05-07T20:03:55.0064216Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0064589Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0064968Z U std::__throw_out_of_range(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0065373Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.0065787Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.0066162Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:55.0066603Z U std::basic_filebuf >::close()@GLIBCXX_3.4 2025-05-07T20:03:55.0067260Z U std::basic_ifstream >::basic_ifstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:55.0067943Z U std::basic_ifstream >::~basic_ifstream()@GLIBCXX_3.4 2025-05-07T20:03:55.0068545Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.0069230Z U std::basic_ofstream >::basic_ofstream(char const*, std::_Ios_Openmode)@GLIBCXX_3.4 2025-05-07T20:03:55.0069903Z U std::basic_ofstream >::~basic_ofstream()@GLIBCXX_3.4 2025-05-07T20:03:55.0070829Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0071654Z U std::chrono::_V2::system_clock::now()@GLIBCXX_3.4.19 2025-05-07T20:03:55.0072031Z U std::cout@GLIBCXX_3.4 2025-05-07T20:03:55.0072381Z U std::ctype::_M_widen_init() const@GLIBCXX_3.4.11 2025-05-07T20:03:55.0072848Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.0073419Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.0073785Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.0074129Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.0074469Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.0074919Z U std::ostream& std::ostream::_M_insert(double)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0075423Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0075962Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0076423Z U std::ostream::flush()@GLIBCXX_3.4 2025-05-07T20:03:55.0076768Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.0077108Z U std::ostream::put(char)@GLIBCXX_3.4 2025-05-07T20:03:55.0077481Z U std::ostream::write(char const*, long)@GLIBCXX_3.4 2025-05-07T20:03:55.0077918Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0078324Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.0078749Z U std::runtime_error::runtime_error(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0079485Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0079633Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.0079741Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.0079841Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.0079948Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.0080039Z U sysconf@GLIBC_2.2.5 2025-05-07T20:03:55.0080159Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.0080786Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.0081257Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.0081763Z U torch::Library::_def(std::variant&&, torch::CppFunction&&, std::vector > const&) & 2025-05-07T20:03:55.0082036Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.0082160Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:55.0082459Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:55.0082654Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:55.0082884Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:55.0083070Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:55.0083430Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:55.0083578Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:55.0083765Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:55.0083953Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:55.0084102Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:55.0084216Z U torch::autograd::Node::metadata() 2025-05-07T20:03:55.0084354Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:55.0084632Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.0084906Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:55.0085055Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:55.0085413Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:55.0085618Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:55.0088162Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:55.0088320Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:55.0088457Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:55.0088606Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:55.0088755Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:55.0089141Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:55.0089487Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.0089871Z U torch::jit::parseSchemaOrName(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.0090063Z U torch::pickle_load(std::vector > const&) 2025-05-07T20:03:55.0090173Z U torch::pickle_save(c10::IValue const&) 2025-05-07T20:03:55.0090696Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:55.0090795Z U typeinfo for c10::Error 2025-05-07T20:03:55.0090884Z U typeinfo for c10::Type 2025-05-07T20:03:55.0091021Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.0091138Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:55.0091279Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.0091406Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.0091513Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:55.0091687Z U unsigned char* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0091890Z U unsigned char* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0092314Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.0092829Z U void fbgemm::FloatOrHalfToFused8BitRowwiseQuantizedSBFloat(unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.0093265Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, float const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.0093791Z U void fbgemm::FloatOrHalfToFusedNBitRowwiseQuantizedSBHalf(int, unsigned short const*, unsigned long, int, unsigned char*) 2025-05-07T20:03:55.0094231Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, float*) 2025-05-07T20:03:55.0094729Z U void fbgemm::Fused8BitRowwiseQuantizedSBFloatToFloatOrHalf(unsigned char const*, unsigned long, int, unsigned short*) 2025-05-07T20:03:55.0095173Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:55.0095698Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalf(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:55.0096185Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, float*, bool) 2025-05-07T20:03:55.0096748Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:55.0097314Z U void fbgemm::FusedNBitRowwiseQuantizedSBHalfToFloatOrHalfRef(int, unsigned char const*, unsigned long, int, unsigned short*, bool) 2025-05-07T20:03:55.0097462Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0097614Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.0097782Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:55.0097926Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0098072Z U vtable for __cxxabiv1::__vmi_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0098184Z U vtable for at::TensorIterator 2025-05-07T20:03:55.0098295Z U vtable for at::TensorIteratorBase 2025-05-07T20:03:55.0098385Z U vtable for c10::Error 2025-05-07T20:03:55.0098483Z U vtable for c10::ListType 2025-05-07T20:03:55.0098800Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0098924Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.0099144Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.0099259Z U vtable for torch::autograd::AutogradMeta 2025-05-07T20:03:55.0099360Z U vtable for torch::autograd::Node 2025-05-07T20:03:55.0099530Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.0099630Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.0099732Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.0099857Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.0099954Z w __gmon_start__ 2025-05-07T20:03:55.0100042Z w __pthread_key_create 2025-05-07T20:03:55.0100144Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.0100259Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.0100342Z w pthread_once 2025-05-07T20:03:55.0100479Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.0100641Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.0100654Z 2025-05-07T20:03:55.0100780Z linux-vdso.so.1 (0x00007ffcf45c5000) 2025-05-07T20:03:55.0100863Z libc10.so => not found 2025-05-07T20:03:55.0100953Z libc10_cuda.so => not found 2025-05-07T20:03:55.0101313Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007fddfba00000) 2025-05-07T20:03:55.0101451Z libnvidia-ml.so.1 => not found 2025-05-07T20:03:55.0101538Z libtorch.so => not found 2025-05-07T20:03:55.0102075Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fddfec09000) 2025-05-07T20:03:55.0102533Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fddfb000000) 2025-05-07T20:03:55.0102625Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0102723Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0102809Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0102958Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fddfad9c000) 2025-05-07T20:03:55.0103073Z libm.so.6 => /lib64/libm.so.6 (0x00007fddfb925000) 2025-05-07T20:03:55.0103215Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fddfebd9000) 2025-05-07T20:03:55.0103328Z libc.so.6 => /lib64/libc.so.6 (0x00007fddfab94000) 2025-05-07T20:03:55.0103444Z /lib64/ld-linux-x86-64.so.2 (0x00007fddfed07000) 2025-05-07T20:03:55.0103529Z libc10.so => not found 2025-05-07T20:03:55.0103866Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fddfbf88000) 2025-05-07T20:03:55.0103950Z libtorch.so => not found 2025-05-07T20:03:55.0104045Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0104129Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0104205Z libc10.so => not found 2025-05-07T20:03:55.0104285Z libc10_cuda.so => not found 2025-05-07T20:03:55.0104374Z libtorch.so => not found 2025-05-07T20:03:55.0104459Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0104542Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0104638Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0104777Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fddfb8cf000) 2025-05-07T20:03:55.0104861Z libtorch.so => not found 2025-05-07T20:03:55.0104935Z libc10.so => not found 2025-05-07T20:03:55.0105028Z libc10_cuda.so => not found 2025-05-07T20:03:55.0105111Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0105204Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0105302Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0105383Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0105466Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0105547Z libtorch.so => not found 2025-05-07T20:03:55.0105678Z librt.so.1 => /lib64/librt.so.1 (0x00007fddfebcc000) 2025-05-07T20:03:55.0105836Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fddfebc7000) 2025-05-07T20:03:55.0105841Z 2025-05-07T20:03:55.0106096Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.0106287Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so 2025-05-07T20:03:55.0106293Z 2025-05-07T20:03:55.0106330Z 2025-05-07T20:03:55.0106477Z Dynamic section at offset 0x27457c0 contains 42 entries: 2025-05-07T20:03:55.0106581Z Tag Type Name/Value 2025-05-07T20:03:55.0106774Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.0106988Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.0107162Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm.so] 2025-05-07T20:03:55.0107356Z 0x0000000000000001 (NEEDED) Shared library: [libnvidia-ml.so.1] 2025-05-07T20:03:55.0107534Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.0107766Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_sparse_async_cumsum.so] 2025-05-07T20:03:55.0107977Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_utils.so] 2025-05-07T20:03:55.0108161Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.0108375Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.0108568Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.0108784Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.0108957Z 0x0000000000000001 (NEEDED) Shared library: [libm.so.6] 2025-05-07T20:03:55.0109133Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.0109342Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.0109537Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.0109722Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_py.so] 2025-05-07T20:03:55.0109895Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.0109997Z 0x000000000000000c (INIT) 0x1b0000 2025-05-07T20:03:55.0110099Z 0x000000000000000d (FINI) 0x73d51c 2025-05-07T20:03:55.0110213Z 0x0000000000000019 (INIT_ARRAY) 0x27387d0 2025-05-07T20:03:55.0110334Z 0x000000000000001b (INIT_ARRAYSZ) 1160 (bytes) 2025-05-07T20:03:55.0110442Z 0x000000000000001a (FINI_ARRAY) 0x2738c58 2025-05-07T20:03:55.0110548Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.0110659Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.0110758Z 0x0000000000000005 (STRTAB) 0x2fcd8 2025-05-07T20:03:55.0110857Z 0x0000000000000006 (SYMTAB) 0x91d0 2025-05-07T20:03:55.0110995Z 0x000000000000000a (STRSZ) 1264098 (bytes) 2025-05-07T20:03:55.0111102Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.0111207Z 0x0000000000000003 (PLTGOT) 0x2746aa0 2025-05-07T20:03:55.0111337Z 0x0000000000000002 (PLTRELSZ) 68832 (bytes) 2025-05-07T20:03:55.0111434Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.0111534Z 0x0000000000000017 (JMPREL) 0x19e3c8 2025-05-07T20:03:55.0111633Z 0x0000000000000007 (RELA) 0x167bd0 2025-05-07T20:03:55.0111767Z 0x0000000000000008 (RELASZ) 223224 (bytes) 2025-05-07T20:03:55.0111874Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.0111967Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.0112092Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.0112200Z 0x000000006ffffffe (VERNEED) 0x167a50 2025-05-07T20:03:55.0112297Z 0x000000006fffffff (VERNEEDNUM) 6 2025-05-07T20:03:55.0112409Z 0x000000006ffffff0 (VERSYM) 0x1646ba 2025-05-07T20:03:55.0112509Z 0x000000006ffffff9 (RELACOUNT) 2456 2025-05-07T20:03:55.0112596Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.0112601Z 2025-05-07T20:03:55.0112790Z ################################################################################ 2025-05-07T20:03:55.0112807Z 2025-05-07T20:03:55.0112811Z 2025-05-07T20:03:55.0112914Z ################################################################################ 2025-05-07T20:03:55.0113400Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0113502Z [CHECK] Listing out library size: 2025-05-07T20:03:55.0113856Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0113862Z 2025-05-07T20:03:55.0114110Z 76 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0114114Z 2025-05-07T20:03:55.0114570Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0115117Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.0115149Z 2025-05-07T20:03:55.0268521Z GLIBC_2.2.5 2025-05-07T20:03:55.0268947Z GLIBC_2.3 2025-05-07T20:03:55.0270170Z GLIBC_2.14 2025-05-07T20:03:55.0270203Z 2025-05-07T20:03:55.0270216Z 2025-05-07T20:03:55.0271984Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0274046Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.0274053Z 2025-05-07T20:03:55.0539895Z GLIBCXX_3.4 2025-05-07T20:03:55.0540176Z GLIBCXX_3.4.9 2025-05-07T20:03:55.0540457Z GLIBCXX_3.4.11 2025-05-07T20:03:55.0540681Z GLIBCXX_3.4.18 2025-05-07T20:03:55.0540907Z GLIBCXX_3.4.20 2025-05-07T20:03:55.0541126Z GLIBCXX_3.4.21 2025-05-07T20:03:55.0541143Z 2025-05-07T20:03:55.0541157Z 2025-05-07T20:03:55.0556850Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.ijzGqkHBYA.symbols.txt 2025-05-07T20:03:55.0556919Z 2025-05-07T20:03:55.0793686Z 2025-05-07T20:03:55.0819478Z [CHECK] Total Number of symbols: 1609 2025-05-07T20:03:55.0843250Z [CHECK] Number of fbgemm symbols: 227 2025-05-07T20:03:55.0859073Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so > /tmp/tmp.8Q81MEZt4v.usymbols.txt 2025-05-07T20:03:55.0859101Z 2025-05-07T20:03:55.0878840Z 2025-05-07T20:03:55.0904808Z [CHECK] Listing out undefined symbols (176 total): 2025-05-07T20:03:55.0917726Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0918908Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0919203Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.0919648Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.0920082Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.0920470Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.0920882Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.0921278Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.0921627Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.0922024Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.0922324Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.0922664Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.0922767Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.0922873Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.0922977Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:55.0923085Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.0923200Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.0923324Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:55.0923501Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:55.0923654Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:55.0924019Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:55.0924182Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:55.0924797Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.0925460Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.0925826Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.0926124Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:55.0926623Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.0927218Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.0927373Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:55.0927498Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:55.0927592Z U at::globalContext() 2025-05-07T20:03:55.0927800Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0927890Z U c10::BoolType::get() 2025-05-07T20:03:55.0928043Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.0928149Z U c10::FloatType::get() 2025-05-07T20:03:55.0928257Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:55.0928419Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0928562Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.0928654Z U c10::IntType::get() 2025-05-07T20:03:55.0928809Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.0928926Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.0929074Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.0929205Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.0929349Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.0929735Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.0929864Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.0929983Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.0930097Z U c10::SymIntType::get() 2025-05-07T20:03:55.0930252Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:55.0930413Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.0930514Z U c10::TensorType::get() 2025-05-07T20:03:55.0930640Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.0931772Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.0931913Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.0932064Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.0932253Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.0932369Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.0932492Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.0932606Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.0932877Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.0932987Z U c10::cuda::device_count() 2025-05-07T20:03:55.0933124Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.0933311Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.0933454Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.0933595Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.0933806Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.0933922Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.0934488Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.0934756Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.0935269Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0935624Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.0936222Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0936347Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.0936467Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.0936619Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:55.0936791Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:55.0936924Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.0937072Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.0937210Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.0937331Z U c10::throwNullDataPtrError() 2025-05-07T20:03:55.0937451Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.0937564Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:55.0937762Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.0937897Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.0938150Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.0938272Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.0938417Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.0938530Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.0938647Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.0938765Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.0938877Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.0938998Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.0939115Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.0939261Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.0939377Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.0939491Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.0939705Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.0939817Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.0939941Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.0940069Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.0942147Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:55.0942391Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.0942503Z U float at::Tensor::item() const 2025-05-07T20:03:55.0942636Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0942813Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0942934Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0943066Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0943230Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.0943363Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.0943501Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.0943597Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.0943700Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.0943808Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.0943919Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.0944239Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.0944527Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.0944832Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.0945142Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.0945464Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.0945839Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.0946228Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0946734Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.0947113Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0947499Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0947925Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.0948447Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.0948763Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.0949120Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.0949246Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.0949373Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.0949506Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0949646Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.0949837Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.0949963Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.0950203Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.0950776Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0950890Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.0951013Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.0951126Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.0951234Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.0951419Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0951647Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.0951767Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.0951882Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.0951975Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.0952097Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.0952761Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.0953396Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.0953666Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.0954059Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.0954630Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:55.0956124Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.0957629Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.0959043Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.0960490Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.0962969Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.0964268Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.0966139Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.0968273Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.0970410Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.0972394Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.0974383Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.0976344Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.0978209Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:55.0978385Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0978559Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.0978720Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.0979080Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.0979310Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.0979423Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.0979543Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.0979650Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.0979743Z w __gmon_start__ 2025-05-07T20:03:55.0979840Z w __pthread_key_create 2025-05-07T20:03:55.0979967Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.0980081Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.0980231Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.0980506Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0980516Z 2025-05-07T20:03:55.0980651Z linux-vdso.so.1 (0x00007ffdb77fb000) 2025-05-07T20:03:55.0980749Z libc10.so => not found 2025-05-07T20:03:55.0980866Z libc10_cuda.so => not found 2025-05-07T20:03:55.0981455Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f2f18a00000) 2025-05-07T20:03:55.0981553Z libtorch.so => not found 2025-05-07T20:03:55.0981667Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0981766Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0981862Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0982044Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f2f1879c000) 2025-05-07T20:03:55.0982193Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f2f3db96000) 2025-05-07T20:03:55.0982313Z libc.so.6 => /lib64/libc.so.6 (0x00007f2f18594000) 2025-05-07T20:03:55.0982444Z /lib64/ld-linux-x86-64.so.2 (0x00007f2f3dbca000) 2025-05-07T20:03:55.0982556Z libc10.so => not found 2025-05-07T20:03:55.0982655Z libc10_cuda.so => not found 2025-05-07T20:03:55.0983166Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f2f1839e000) 2025-05-07T20:03:55.0983747Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f2f3da9c000) 2025-05-07T20:03:55.0983846Z libtorch.so => not found 2025-05-07T20:03:55.0984325Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f2f17e00000) 2025-05-07T20:03:55.0984776Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f2f17400000) 2025-05-07T20:03:55.0984899Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0984993Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0985089Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0985227Z libm.so.6 => /lib64/libm.so.6 (0x00007f2f17d25000) 2025-05-07T20:03:55.0985342Z libc10.so => not found 2025-05-07T20:03:55.0985432Z libc10_cuda.so => not found 2025-05-07T20:03:55.0985873Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f2f3da8d000) 2025-05-07T20:03:55.0985963Z libtorch.so => not found 2025-05-07T20:03:55.0986085Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0986196Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0986290Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0986441Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f2f3da35000) 2025-05-07T20:03:55.0986528Z libc10.so => not found 2025-05-07T20:03:55.0986638Z libc10_cuda.so => not found 2025-05-07T20:03:55.0986729Z libtorch.so => not found 2025-05-07T20:03:55.0986823Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0986932Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0987028Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0987120Z libc10.so => not found 2025-05-07T20:03:55.0987464Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f2f38b88000) 2025-05-07T20:03:55.0987576Z libtorch.so => not found 2025-05-07T20:03:55.0987675Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0987769Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0987879Z libtorch.so => not found 2025-05-07T20:03:55.0987965Z libc10.so => not found 2025-05-07T20:03:55.0988058Z libc10_cuda.so => not found 2025-05-07T20:03:55.0988153Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0988264Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0988367Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.0988463Z libtorch.so => not found 2025-05-07T20:03:55.0988581Z libc10.so => not found 2025-05-07T20:03:55.0988685Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0988784Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0988963Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f2f3da28000) 2025-05-07T20:03:55.0989092Z libtorch_cpu.so => not found 2025-05-07T20:03:55.0989200Z libtorch_cuda.so => not found 2025-05-07T20:03:55.0989297Z libtorch.so => not found 2025-05-07T20:03:55.0989457Z librt.so.1 => /lib64/librt.so.1 (0x00007f2f3da21000) 2025-05-07T20:03:55.0989462Z 2025-05-07T20:03:55.0989566Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.0989848Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_gwd.so 2025-05-07T20:03:55.0989853Z 2025-05-07T20:03:55.1004220Z 2025-05-07T20:03:55.1005406Z Dynamic section at offset 0x4b7dd08 contains 38 entries: 2025-05-07T20:03:55.1005786Z Tag Type Name/Value 2025-05-07T20:03:55.1006369Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.1007024Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.1007808Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:55.1008392Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.1009271Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.1009890Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.1010507Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.1011115Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.1011685Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.1012240Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.1012894Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.1013556Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_gwd.so] 2025-05-07T20:03:55.1013749Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.1013929Z 0x000000000000000c (INIT) 0xac000 2025-05-07T20:03:55.1014073Z 0x000000000000000d (FINI) 0x5df4cc 2025-05-07T20:03:55.1014203Z 0x0000000000000019 (INIT_ARRAY) 0x4b7d9f8 2025-05-07T20:03:55.1014339Z 0x000000000000001b (INIT_ARRAYSZ) 200 (bytes) 2025-05-07T20:03:55.1014491Z 0x000000000000001a (FINI_ARRAY) 0x4b7dac0 2025-05-07T20:03:55.1014665Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.1014785Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.1014912Z 0x0000000000000005 (STRTAB) 0xc368 2025-05-07T20:03:55.1015027Z 0x0000000000000006 (SYMTAB) 0x2c78 2025-05-07T20:03:55.1015170Z 0x000000000000000a (STRSZ) 595540 (bytes) 2025-05-07T20:03:55.1015291Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.1015426Z 0x0000000000000003 (PLTGOT) 0x4b7efa8 2025-05-07T20:03:55.1015561Z 0x0000000000000002 (PLTRELSZ) 12672 (bytes) 2025-05-07T20:03:55.1015671Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.1015795Z 0x0000000000000017 (JMPREL) 0xa7fe0 2025-05-07T20:03:55.1015905Z 0x0000000000000007 (RELA) 0x9e770 2025-05-07T20:03:55.1016034Z 0x0000000000000008 (RELASZ) 39024 (bytes) 2025-05-07T20:03:55.1016158Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.1016275Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.1016401Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.1016518Z 0x000000006ffffffe (VERNEED) 0x9e650 2025-05-07T20:03:55.1016638Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:55.1016751Z 0x000000006ffffff0 (VERSYM) 0x9d9bc 2025-05-07T20:03:55.1016866Z 0x000000006ffffff9 (RELACOUNT) 239 2025-05-07T20:03:55.1016977Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.1017005Z 2025-05-07T20:03:55.1017121Z ################################################################################ 2025-05-07T20:03:55.1017129Z 2025-05-07T20:03:55.1017133Z 2025-05-07T20:03:55.1017249Z ################################################################################ 2025-05-07T20:03:55.1017610Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1017716Z [CHECK] Listing out library size: 2025-05-07T20:03:55.1018156Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1018160Z 2025-05-07T20:03:55.1021755Z 31 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1021876Z 2025-05-07T20:03:55.1041043Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1041890Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.1041906Z 2025-05-07T20:03:55.1186076Z GLIBC_2.2.5 2025-05-07T20:03:55.1186732Z GLIBC_2.3 2025-05-07T20:03:55.1187011Z GLIBC_2.14 2025-05-07T20:03:55.1187031Z 2025-05-07T20:03:55.1187044Z 2025-05-07T20:03:55.1188512Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1190292Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.1190307Z 2025-05-07T20:03:55.1341991Z GLIBCXX_3.4 2025-05-07T20:03:55.1342626Z GLIBCXX_3.4.9 2025-05-07T20:03:55.1342970Z GLIBCXX_3.4.11 2025-05-07T20:03:55.1343069Z GLIBCXX_3.4.15 2025-05-07T20:03:55.1343196Z GLIBCXX_3.4.18 2025-05-07T20:03:55.1343284Z GLIBCXX_3.4.20 2025-05-07T20:03:55.1343372Z GLIBCXX_3.4.21 2025-05-07T20:03:55.1343380Z 2025-05-07T20:03:55.1343444Z 2025-05-07T20:03:55.1361288Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.qkoAIq5raX.symbols.txt 2025-05-07T20:03:55.1361335Z 2025-05-07T20:03:55.1483458Z 2025-05-07T20:03:55.1510438Z [CHECK] Total Number of symbols: 1857 2025-05-07T20:03:55.1534894Z [CHECK] Number of fbgemm symbols: 100 2025-05-07T20:03:55.1556446Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so > /tmp/tmp.tfGMeQh2GI.usymbols.txt 2025-05-07T20:03:55.1556497Z 2025-05-07T20:03:55.1580025Z 2025-05-07T20:03:55.1604775Z [CHECK] Listing out undefined symbols (267 total): 2025-05-07T20:03:55.1622737Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.1623149Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.1623344Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.1623532Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.1623791Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.1623974Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.1624140Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.1624286Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.1624430Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.1624609Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.1624737Z U __cxa_allocate_exception@CXXABI_1.3 2025-05-07T20:03:55.1624868Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.1625000Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.1625122Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.1625240Z U __cxa_free_exception@CXXABI_1.3 2025-05-07T20:03:55.1625374Z U __cxa_guard_abort@CXXABI_1.3 2025-05-07T20:03:55.1625496Z U __cxa_guard_acquire@CXXABI_1.3 2025-05-07T20:03:55.1625609Z U __cxa_guard_release@CXXABI_1.3 2025-05-07T20:03:55.1625716Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.1625854Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:55.1625960Z U __cxa_throw@CXXABI_1.3 2025-05-07T20:03:55.1626079Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.1626209Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.1626330Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:55.1626484Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:55.1626689Z U at::RecordFunction::RecordFunction(at::StepCallbacks&&) 2025-05-07T20:03:55.1626823Z U at::RecordFunction::currentThreadId() 2025-05-07T20:03:55.1626945Z U at::RecordFunction::end() 2025-05-07T20:03:55.1627077Z U at::RecordFunction::~RecordFunction() 2025-05-07T20:03:55.1627445Z U at::SavedTensorDefaultHooks::set_tracing(bool) 2025-05-07T20:03:55.1627660Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:55.1627833Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:55.1628449Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1629101Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1629324Z U at::_ops::view_as::call(at::Tensor const&, at::Tensor const&) 2025-05-07T20:03:55.1629862Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1630480Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.1630621Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:55.1630748Z U at::functorch::functorchTLSAccessor() 2025-05-07T20:03:55.1630909Z U at::getStepCallbacksUnlessEmpty(at::RecordScope) 2025-05-07T20:03:55.1631018Z U at::globalContext() 2025-05-07T20:03:55.1631142Z U at::sequence_number::get_and_increment() 2025-05-07T20:03:55.1631465Z U bcmp@GLIBC_2.2.5 2025-05-07T20:03:55.1631572Z U c10::AnyType::get() 2025-05-07T20:03:55.1631784Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1631884Z U c10::BoolType::get() 2025-05-07T20:03:55.1632054Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.1632237Z U c10::Dispatcher::findSchemaOrThrow(char const*, char const*) 2025-05-07T20:03:55.1632351Z U c10::Dispatcher::realSingleton() 2025-05-07T20:03:55.1632985Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet) 2025-05-07T20:03:55.1633633Z U c10::Dispatcher::runRecordFunction(at::RecordFunction&, std::reference_wrapper, c10::DispatchKey, c10::DispatchKeySet, c10::ArrayRef) 2025-05-07T20:03:55.1634019Z U c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.1634135Z U c10::Error::what() const 2025-05-07T20:03:55.1634233Z U c10::FloatType::get() 2025-05-07T20:03:55.1634339Z U c10::GradMode::is_enabled() 2025-05-07T20:03:55.1634461Z U c10::GradMode::set_enabled(bool) 2025-05-07T20:03:55.1634698Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1634866Z U c10::IValue::TagType::get(c10::IValue const&) 2025-05-07T20:03:55.1634981Z U c10::IValue::hash(c10::IValue const&) 2025-05-07T20:03:55.1635091Z U c10::IValue::isBoolList() const 2025-05-07T20:03:55.1635213Z U c10::IValue::isIntList() const 2025-05-07T20:03:55.1635328Z U c10::IValue::isSymIntList() const 2025-05-07T20:03:55.1635442Z U c10::IValue::isTensorList() const 2025-05-07T20:03:55.1635597Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.1635698Z U c10::IntType::get() 2025-05-07T20:03:55.1635904Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.1636036Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.1636164Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.1636289Z U c10::OperatorHandle::~OperatorHandle() 2025-05-07T20:03:55.1636524Z U c10::OptionalType::get(c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.1636805Z U c10::SmallVectorBase::grow_pod(void const*, unsigned long, unsigned long) 2025-05-07T20:03:55.1637063Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.1637182Z U c10::StringType::get() 2025-05-07T20:03:55.1637328Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:55.1637497Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.1637669Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:55.1637830Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.1638011Z U c10::SymFloat::operator/(c10::SymFloat const&) const 2025-05-07T20:03:55.1638428Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.1638602Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.1638736Z U c10::SymInt::operator c10::SymFloat() const 2025-05-07T20:03:55.1638879Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:55.1639004Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.1639137Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:55.1639267Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:55.1639516Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:55.1639624Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.1639720Z U c10::SymIntType::get() 2025-05-07T20:03:55.1639878Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:55.1639991Z U c10::TensorImpl::requires_grad() const 2025-05-07T20:03:55.1640135Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.1640233Z U c10::TensorType::get() 2025-05-07T20:03:55.1640346Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.1641026Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.1641164Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.1641275Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.1641386Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.1641502Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.1641616Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.1641721Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.1641966Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.1642060Z U c10::cuda::device_count() 2025-05-07T20:03:55.1642186Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.1642311Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.1642455Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.1642585Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.1642752Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.1642866Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.1643276Z U c10::detail::ListImpl::ListImpl(std::vector >, c10::Type::SingletonOrSharedTypePtr) 2025-05-07T20:03:55.1643774Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.1644010Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.1644500Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.1644850Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.1645402Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.1645534Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.1645649Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.1645952Z U c10::impl::OperatorEntry::assertSignatureIsCorrect(c10::impl::CppSignature const&, bool) const 2025-05-07T20:03:55.1646122Z U c10::impl::OperatorEntry::reportError(c10::DispatchKey) const 2025-05-07T20:03:55.1646272Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:55.1646424Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:55.1646536Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.1646656Z U c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.1646805Z U c10::initializeFunctionalityOffsetsAndMasks() 2025-05-07T20:03:55.1647152Z U c10::ivalue::ConstantString::create(std::__cxx11::basic_string, std::allocator >) 2025-05-07T20:03:55.1647277Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.1647414Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.1647538Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.1647696Z U c10::operator<<(std::ostream&, c10::OperatorName const&) 2025-05-07T20:03:55.1647834Z U c10::operator<<(std::ostream&, c10::SymFloat const&) 2025-05-07T20:03:55.1647966Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.1648079Z U c10::throwNullDataPtrError() 2025-05-07T20:03:55.1648178Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.1648283Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:55.1648462Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.1648585Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.1648713Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.1648833Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.1648969Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.1649081Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.1649199Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.1649314Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.1649423Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.1649537Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.1649652Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.1649811Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.1649924Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.1650034Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.1650158Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.1650264Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.1650381Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.1650509Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.1652605Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:55.1652929Z U fbgemm_gpu::config::is_feature_enabled(fbgemm_gpu::config::FeatureGateName const&) 2025-05-07T20:03:55.1653062Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.1653210Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1653304Z U free@GLIBC_2.2.5 2025-05-07T20:03:55.1653422Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.1653556Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1653727Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.1653852Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.1653990Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.1654082Z U memcmp@GLIBC_2.2.5 2025-05-07T20:03:55.1654176Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.1654271Z U memmove@GLIBC_2.2.5 2025-05-07T20:03:55.1654356Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.1654472Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.1654588Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.1654895Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.1655213Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.1655300Z U realloc@GLIBC_2.2.5 2025-05-07T20:03:55.1655503Z U std::_Hash_bytes(void const*, unsigned long, unsigned long)@CXXABI_1.3.5 2025-05-07T20:03:55.1655834Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.1656207Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.1656583Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1657098Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.1657459Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1657871Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1658298Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.1658783Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1659129Z U std::__cxx11::basic_string, std::allocator >::reserve(unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1659675Z U std::__cxx11::basic_string, std::allocator >::swap(std::__cxx11::basic_string, std::allocator >&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1660011Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.1660381Z U std::__cxx11::basic_stringstream, std::allocator >::~basic_stringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.1660750Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.1660874Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.1660980Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.1661113Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.1661246Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.1661415Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.1661544Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.1661676Z U std::bad_weak_ptr::~bad_weak_ptr()@GLIBCXX_3.4.15 2025-05-07T20:03:55.1661912Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.1662464Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.1662601Z U std::exception::~exception()@GLIBCXX_3.4 2025-05-07T20:03:55.1662713Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.1662824Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.1662939Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.1663057Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.1663231Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.1663454Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.1663581Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.1663733Z U std::out_of_range::out_of_range(char const*)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1663861Z U std::out_of_range::~out_of_range()@GLIBCXX_3.4 2025-05-07T20:03:55.1664273Z U std::runtime_error::runtime_error(std::__cxx11::basic_string, std::allocator > const&)@GLIBCXX_3.4.21 2025-05-07T20:03:55.1664400Z U std::runtime_error::~runtime_error()@GLIBCXX_3.4 2025-05-07T20:03:55.1664499Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.1664597Z U strcmp@GLIBC_2.2.5 2025-05-07T20:03:55.1664683Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.1664802Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.1665402Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.1665839Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.1666081Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.1666203Z U torch::autograd::AnomalyMode::_enabled 2025-05-07T20:03:55.1666476Z U torch::autograd::AutogradContext::AutogradContext(torch::dynamo::autograd::PackedArgs&) 2025-05-07T20:03:55.1666670Z U torch::autograd::AutogradContext::get_and_bump_dirty() const 2025-05-07T20:03:55.1666875Z U torch::autograd::AutogradContext::get_non_differentiable() const 2025-05-07T20:03:55.1667069Z U torch::autograd::AutogradContext::get_saved_variables() const 2025-05-07T20:03:55.1667396Z U torch::autograd::AutogradContext::save_for_backward(std::vector >) 2025-05-07T20:03:55.1667551Z U torch::autograd::AutogradContext::save_variables() 2025-05-07T20:03:55.1667747Z U torch::autograd::ForwardADLevel::try_get_by_idx(unsigned long) 2025-05-07T20:03:55.1667912Z U torch::autograd::InputMetadata::shape_as_dim_vector() const 2025-05-07T20:03:55.1668036Z U torch::autograd::Node::assign_parent() 2025-05-07T20:03:55.1668141Z U torch::autograd::Node::metadata() 2025-05-07T20:03:55.1668266Z U torch::autograd::Node::name[abi:cxx11]() const 2025-05-07T20:03:55.1668507Z U torch::autograd::SavedVariable::SavedVariable(at::Tensor const&, bool, bool) 2025-05-07T20:03:55.1668761Z U torch::autograd::SavedVariable::unpack(std::shared_ptr) const 2025-05-07T20:03:55.1668891Z U torch::autograd::VariableInfo::VariableInfo() 2025-05-07T20:03:55.1669101Z U torch::autograd::VariableInfo::VariableInfo(at::Tensor const&, bool) 2025-05-07T20:03:55.1669308Z U torch::autograd::VariableInfo::zeros(c10::OptionalDeviceGuard&) const 2025-05-07T20:03:55.1671859Z U torch::autograd::_wrap_outputs(std::vector > const&, std::unordered_set, std::equal_to, std::allocator > const&, std::unordered_set, std::equal_to, std::allocator > const&, c10::ArrayRef >, std::shared_ptr const&, std::function > (std::vector >, std::vector >)> const&, std::unordered_set, std::equal_to, std::allocator > const&, std::function const&) 2025-05-07T20:03:55.1672006Z U torch::autograd::deleteNode(torch::autograd::Node*) 2025-05-07T20:03:55.1672157Z U torch::autograd::get_current_graph_task_exec_info() 2025-05-07T20:03:55.1672487Z U torch::autograd::impl::gradient_edge(at::Tensor const&) 2025-05-07T20:03:55.1673575Z U torch::autograd::profiler::record_function_enter_new(std::__cxx11::basic_string, std::allocator > const&, std::optional, std::allocator > > const&) 2025-05-07T20:03:55.1673750Z U torch::dynamo::autograd::getPyCompilerInterface() 2025-05-07T20:03:55.1674167Z U torch::dynamo::autograd::get_input_metadata(std::vector > const&) 2025-05-07T20:03:55.1674567Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.1675153Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:55.1675260Z U typeinfo for c10::Error 2025-05-07T20:03:55.1675415Z U typeinfo for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.1675542Z U typeinfo for std::exception@GLIBCXX_3.4 2025-05-07T20:03:55.1675700Z U typeinfo for std::out_of_range@GLIBCXX_3.4 2025-05-07T20:03:55.1675832Z U typeinfo for std::runtime_error@GLIBCXX_3.4 2025-05-07T20:03:55.1675964Z U typeinfo for torch::autograd::Node 2025-05-07T20:03:55.1677504Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.1678980Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.1680373Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.1681786Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.1683372Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.1684658Z U void embedding_ops::grad_mean_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, fbgemm_gpu::FixedDivisor) 2025-05-07T20:03:55.1684802Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.1684961Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.1685107Z U vtable for __cxxabiv1::__pointer_type_info@CXXABI_1.3 2025-05-07T20:03:55.1685256Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.1685359Z U vtable for c10::Error 2025-05-07T20:03:55.1685676Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.1685821Z U vtable for std::bad_weak_ptr@GLIBCXX_3.4.15 2025-05-07T20:03:55.1686043Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.1686151Z U vtable for torch::autograd::Node 2025-05-07T20:03:55.1686318Z w TLS init function for c10::impl::raw_local_dispatch_key_set 2025-05-07T20:03:55.1686429Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.1686528Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.1686627Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.1686723Z w __gmon_start__ 2025-05-07T20:03:55.1686831Z w __pthread_key_create 2025-05-07T20:03:55.1686934Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.1687035Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.1687196Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.1687447Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1687455Z 2025-05-07T20:03:55.1687592Z linux-vdso.so.1 (0x00007ffdec31a000) 2025-05-07T20:03:55.1687673Z libc10.so => not found 2025-05-07T20:03:55.1687781Z libc10_cuda.so => not found 2025-05-07T20:03:55.1688322Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007fbcce200000) 2025-05-07T20:03:55.1688418Z libtorch.so => not found 2025-05-07T20:03:55.1688504Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1688592Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1688692Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.1688846Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007fbccdf9c000) 2025-05-07T20:03:55.1688988Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007fbcf050a000) 2025-05-07T20:03:55.1689139Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007fbcf04dc000) 2025-05-07T20:03:55.1689258Z libc.so.6 => /lib64/libc.so.6 (0x00007fbccdd94000) 2025-05-07T20:03:55.1689377Z /lib64/ld-linux-x86-64.so.2 (0x00007fbcf0566000) 2025-05-07T20:03:55.1689455Z libc10.so => not found 2025-05-07T20:03:55.1689548Z libc10_cuda.so => not found 2025-05-07T20:03:55.1689997Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007fbccdb9e000) 2025-05-07T20:03:55.1690513Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007fbccdaa6000) 2025-05-07T20:03:55.1690604Z libtorch.so => not found 2025-05-07T20:03:55.1690942Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007fbccd400000) 2025-05-07T20:03:55.1691371Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007fbccca00000) 2025-05-07T20:03:55.1691469Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1691556Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1691647Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.1691763Z libm.so.6 => /lib64/libm.so.6 (0x00007fbccd9cb000) 2025-05-07T20:03:55.1691848Z libc10.so => not found 2025-05-07T20:03:55.1691933Z libc10_cuda.so => not found 2025-05-07T20:03:55.1692347Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007fbcf04cb000) 2025-05-07T20:03:55.1692439Z libtorch.so => not found 2025-05-07T20:03:55.1692523Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1692607Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1692703Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.1692778Z libc10.so => not found 2025-05-07T20:03:55.1692861Z libc10_cuda.so => not found 2025-05-07T20:03:55.1692943Z libtorch.so => not found 2025-05-07T20:03:55.1693040Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1693127Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1693235Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.1693317Z libc10.so => not found 2025-05-07T20:03:55.1693649Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007fbcf044f000) 2025-05-07T20:03:55.1693736Z libtorch.so => not found 2025-05-07T20:03:55.1693821Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1693914Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1693999Z libtorch.so => not found 2025-05-07T20:03:55.1694080Z libc10.so => not found 2025-05-07T20:03:55.1694167Z libc10_cuda.so => not found 2025-05-07T20:03:55.1694253Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1694364Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1694454Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.1694544Z libtorch.so => not found 2025-05-07T20:03:55.1694622Z libc10.so => not found 2025-05-07T20:03:55.1694705Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1694820Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1694986Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007fbcf0444000) 2025-05-07T20:03:55.1695071Z libtorch_cpu.so => not found 2025-05-07T20:03:55.1695160Z libtorch_cuda.so => not found 2025-05-07T20:03:55.1695254Z libtorch.so => not found 2025-05-07T20:03:55.1695413Z librt.so.1 => /lib64/librt.so.1 (0x00007fbcf043d000) 2025-05-07T20:03:55.1695419Z 2025-05-07T20:03:55.1695513Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.1695803Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_dense.so 2025-05-07T20:03:55.1695807Z 2025-05-07T20:03:55.1710895Z 2025-05-07T20:03:55.1711687Z Dynamic section at offset 0x1e278a8 contains 39 entries: 2025-05-07T20:03:55.1712106Z Tag Type Name/Value 2025-05-07T20:03:55.1713091Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.1713329Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.1713614Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:55.1713815Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.1714018Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.1714239Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.1714448Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.1714646Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.1714839Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:55.1715043Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.1715234Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.1715451Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.1715745Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_dense.so] 2025-05-07T20:03:55.1715928Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.1716039Z 0x000000000000000c (INIT) 0x79000 2025-05-07T20:03:55.1716165Z 0x000000000000000d (FINI) 0x25a06c 2025-05-07T20:03:55.1716289Z 0x0000000000000019 (INIT_ARRAY) 0x1e260e0 2025-05-07T20:03:55.1716413Z 0x000000000000001b (INIT_ARRAYSZ) 184 (bytes) 2025-05-07T20:03:55.1716529Z 0x000000000000001a (FINI_ARRAY) 0x1e26198 2025-05-07T20:03:55.1716657Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.1716771Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.1716880Z 0x0000000000000005 (STRTAB) 0xe130 2025-05-07T20:03:55.1717000Z 0x0000000000000006 (SYMTAB) 0x3300 2025-05-07T20:03:55.1717136Z 0x000000000000000a (STRSZ) 373406 (bytes) 2025-05-07T20:03:55.1718658Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.1718816Z 0x0000000000000003 (PLTGOT) 0x1e27b58 2025-05-07T20:03:55.1718953Z 0x0000000000000002 (PLTRELSZ) 18480 (bytes) 2025-05-07T20:03:55.1719066Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.1719180Z 0x0000000000000017 (JMPREL) 0x73f80 2025-05-07T20:03:55.1719293Z 0x0000000000000007 (RELA) 0x6a398 2025-05-07T20:03:55.1719424Z 0x0000000000000008 (RELASZ) 39912 (bytes) 2025-05-07T20:03:55.1719541Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.1719641Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.1719803Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.1719914Z 0x000000006ffffffe (VERNEED) 0x6a258 2025-05-07T20:03:55.1720027Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:55.1720174Z 0x000000006ffffff0 (VERSYM) 0x693ce 2025-05-07T20:03:55.1720282Z 0x000000006ffffff9 (RELACOUNT) 270 2025-05-07T20:03:55.1720379Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.1720384Z 2025-05-07T20:03:55.1720506Z ################################################################################ 2025-05-07T20:03:55.1720544Z 2025-05-07T20:03:55.1720549Z 2025-05-07T20:03:55.1720658Z ################################################################################ 2025-05-07T20:03:55.1720989Z [CHECK] BUILT LIBRARY: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.1721099Z [CHECK] Listing out library size: 2025-05-07T20:03:55.1721417Z + du -h --block-size=1M ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.1721422Z 2025-05-07T20:03:55.1725140Z 175 ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.1725166Z 2025-05-07T20:03:55.1726113Z [CHECK] Listing out the GLIBC versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.1726662Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBC_ | sed 's/.*GLIBC_\([.0-9]*\).*/GLIBC_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.2363475Z 2025-05-07T20:03:55.2363896Z GLIBC_2.2.5 2025-05-07T20:03:55.2364507Z GLIBC_2.3 2025-05-07T20:03:55.2365045Z GLIBC_2.14 2025-05-07T20:03:55.2365356Z 2025-05-07T20:03:55.2365379Z 2025-05-07T20:03:55.2366773Z [CHECK] Listing out the GLIBCXX versions referenced by: ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.2370152Z + objdump -TC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so | grep GLIBCXX_ | sed 's/.*GLIBCXX_\([.0-9]*\).*/GLIBCXX_\1/g' | sort -Vu | cat 2025-05-07T20:03:55.2372149Z 2025-05-07T20:03:55.3003487Z GLIBCXX_3.4 2025-05-07T20:03:55.3004156Z GLIBCXX_3.4.9 2025-05-07T20:03:55.3004739Z GLIBCXX_3.4.11 2025-05-07T20:03:55.3005297Z GLIBCXX_3.4.18 2025-05-07T20:03:55.3005870Z GLIBCXX_3.4.20 2025-05-07T20:03:55.3006423Z GLIBCXX_3.4.21 2025-05-07T20:03:55.3006775Z 2025-05-07T20:03:55.3006790Z 2025-05-07T20:03:55.3027441Z + nm -gDC ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.BbUphUfts3.symbols.txt 2025-05-07T20:03:55.3029069Z 2025-05-07T20:03:55.3634145Z 2025-05-07T20:03:55.3670347Z [CHECK] Total Number of symbols: 3695 2025-05-07T20:03:55.3707468Z [CHECK] Number of fbgemm symbols: 551 2025-05-07T20:03:55.3725370Z + nm -gDCu ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so > /tmp/tmp.SOYsYeMZ4P.usymbols.txt 2025-05-07T20:03:55.3726133Z 2025-05-07T20:03:55.3756086Z 2025-05-07T20:03:55.3782620Z [CHECK] Listing out undefined symbols (183 total): 2025-05-07T20:03:55.3798895Z U VTT for std::__cxx11::basic_ostringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.3799869Z U VTT for std::__cxx11::basic_stringstream, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.3800417Z U _Unwind_Resume@GCC_3.0 2025-05-07T20:03:55.3800767Z U __cudaPopCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.3801175Z U __cudaPushCallConfiguration@libcudart.so.11.0 2025-05-07T20:03:55.3801571Z U __cudaRegisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.3801949Z U __cudaRegisterFatBinaryEnd@libcudart.so.11.0 2025-05-07T20:03:55.3802338Z U __cudaRegisterFunction@libcudart.so.11.0 2025-05-07T20:03:55.3802742Z U __cudaRegisterVar@libcudart.so.11.0 2025-05-07T20:03:55.3803110Z U __cudaUnregisterFatBinary@libcudart.so.11.0 2025-05-07T20:03:55.3803454Z U __cxa_atexit@GLIBC_2.2.5 2025-05-07T20:03:55.3803817Z U __cxa_begin_catch@CXXABI_1.3 2025-05-07T20:03:55.3804126Z U __cxa_end_catch@CXXABI_1.3 2025-05-07T20:03:55.3804433Z U __cxa_rethrow@CXXABI_1.3 2025-05-07T20:03:55.3804747Z U __cxa_thread_atexit@CXXABI_1.3.7 2025-05-07T20:03:55.3805067Z U __gxx_personality_v0@CXXABI_1.3 2025-05-07T20:03:55.3805427Z U __tls_get_addr@GLIBC_2.3 2025-05-07T20:03:55.3805731Z U adjust_info_B_num_bits(int, int) 2025-05-07T20:03:55.3806078Z U at::CUDAGeneratorImpl::device_type() 2025-05-07T20:03:55.3806539Z U at::CUDAGeneratorImpl::philox_cuda_state(unsigned long) 2025-05-07T20:03:55.3806965Z U at::Context::deterministicAlgorithms() const 2025-05-07T20:03:55.3807399Z U at::TensorBase::__dispatch_contiguous(c10::MemoryFormat) const 2025-05-07T20:03:55.3807865Z U at::_ops::copy_::call(at::Tensor&, at::Tensor const&, bool) 2025-05-07T20:03:55.3808739Z U at::_ops::empty_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.3810111Z U at::_ops::empty_memory_format::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.3811095Z U at::_ops::reshape::call(at::Tensor const&, c10::ArrayRef) 2025-05-07T20:03:55.3811705Z U at::_ops::to_dtype::call(at::Tensor const&, c10::ScalarType, bool, bool, std::optional) 2025-05-07T20:03:55.3812617Z U at::_ops::zeros::call(c10::ArrayRef, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.3814000Z U at::_ops::zeros_like::call(at::Tensor const&, std::optional, std::optional, std::optional, std::optional, std::optional) 2025-05-07T20:03:55.3814800Z U at::cuda::detail::getDefaultCUDAGenerator(signed char) 2025-05-07T20:03:55.3815188Z U at::cuda::getCurrentDeviceProperties() 2025-05-07T20:03:55.3815512Z U at::globalContext() 2025-05-07T20:03:55.3815878Z U c10::BFloat16* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.3816272Z U c10::BoolType::get() 2025-05-07T20:03:55.3816600Z U c10::DeviceTypeName[abi:cxx11](c10::DeviceType, bool) 2025-05-07T20:03:55.3816946Z U c10::FloatType::get() 2025-05-07T20:03:55.3817244Z U c10::GeneratorImpl::device() const 2025-05-07T20:03:55.3817609Z U c10::Half* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.3818008Z U c10::IValue::reportToTensorTypeError() const 2025-05-07T20:03:55.3818326Z U c10::IntType::get() 2025-05-07T20:03:55.3818691Z U c10::MessageLogger::MessageLogger(char const*, int, int) 2025-05-07T20:03:55.3819061Z U c10::MessageLogger::~MessageLogger() 2025-05-07T20:03:55.3819420Z U c10::StorageImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.3819801Z U c10::SymBool::expect_true(char const*, long) const 2025-05-07T20:03:55.3820159Z U c10::SymBool::guard_bool(char const*, long) const 2025-05-07T20:03:55.3820553Z U c10::SymBool::guard_size_oblivious(char const*, long) const 2025-05-07T20:03:55.3820948Z U c10::SymFloat::guard_float(char const*, long) const 2025-05-07T20:03:55.3821662Z U c10::SymInt::SymInt(c10::intrusive_ptr >) 2025-05-07T20:03:55.3822332Z U c10::SymInt::guard_int(char const*, long) const 2025-05-07T20:03:55.3822688Z U c10::SymInt::operator/(c10::SymInt const&) const 2025-05-07T20:03:55.3823032Z U c10::SymInt::promote_to_negative() 2025-05-07T20:03:55.3823356Z U c10::SymInt::sym_eq(c10::SymInt const&) const 2025-05-07T20:03:55.3823740Z U c10::SymInt::sym_gt(c10::SymInt const&) const 2025-05-07T20:03:55.3824097Z U c10::SymInt::sym_le(c10::SymInt const&) const 2025-05-07T20:03:55.3824421Z U c10::SymInt::toSymNode() const 2025-05-07T20:03:55.3824734Z U c10::SymIntType::get() 2025-05-07T20:03:55.3825074Z U c10::SymbolicShapeMeta::init_is_contiguous() const 2025-05-07T20:03:55.3825474Z U c10::TensorImpl::throw_data_ptr_access_error() const 2025-05-07T20:03:55.3825810Z U c10::TensorType::get() 2025-05-07T20:03:55.3826119Z U c10::UndefinedTensorImpl::_singleton 2025-05-07T20:03:55.3827020Z U c10::Warning::Warning(std::variant, c10::SourceLocation const&, std::__cxx11::basic_string, std::allocator >, bool) 2025-05-07T20:03:55.3827910Z U c10::cuda::CUDACachingAllocator::allocator 2025-05-07T20:03:55.3828265Z U c10::cuda::CUDAStream::stream() const 2025-05-07T20:03:55.3828585Z U c10::cuda::ExchangeDevice(signed char) 2025-05-07T20:03:55.3828908Z U c10::cuda::GetDevice(signed char*) 2025-05-07T20:03:55.3829229Z U c10::cuda::MaybeSetDevice(signed char) 2025-05-07T20:03:55.3829536Z U c10::cuda::SetDevice(signed char) 2025-05-07T20:03:55.3829983Z U c10::cuda::c10_cuda_check_implementation(int, char const*, char const*, int, bool) 2025-05-07T20:03:55.3830411Z U c10::cuda::device_count() 2025-05-07T20:03:55.3830738Z U c10::cuda::getCurrentCUDAStream(signed char) 2025-05-07T20:03:55.3831104Z U c10::cuda::getDefaultCUDAStream(signed char) 2025-05-07T20:03:55.3831888Z U c10::cuda::getStreamFromPool(bool, signed char) 2025-05-07T20:03:55.3832284Z U c10::cuda::getStreamFromPool(int, signed char) 2025-05-07T20:03:55.3832813Z U c10::cuda::setCurrentCUDAStream(c10::cuda::CUDAStream) 2025-05-07T20:03:55.3833209Z U c10::cuda::warn_or_error_on_sync() 2025-05-07T20:03:55.3833965Z U c10::detail::infer_schema::make_function_schema(c10::ArrayRef, c10::ArrayRef) 2025-05-07T20:03:55.3834862Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, char const*) 2025-05-07T20:03:55.3835762Z U c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.3836818Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, char const*) 2025-05-07T20:03:55.3837892Z U c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.3838736Z U c10::impl::GPUTrace::gpuTraceState 2025-05-07T20:03:55.3839086Z U c10::impl::GPUTrace::haveState 2025-05-07T20:03:55.3839481Z U c10::impl::cow::is_cow_data_ptr(c10::DataPtr const&) 2025-05-07T20:03:55.3839939Z U c10::impl::cow::materialize_cow_storage(c10::StorageImpl&) 2025-05-07T20:03:55.3840387Z U c10::impl::device_guard_impl_registry 2025-05-07T20:03:55.3840764Z U c10::operator+(c10::SymInt const&, int) 2025-05-07T20:03:55.3841131Z U c10::operator-(c10::SymInt const&, int) 2025-05-07T20:03:55.3841571Z U c10::operator<<(std::ostream&, c10::Device const&) 2025-05-07T20:03:55.3841978Z U c10::operator<<(std::ostream&, c10::DeviceType) 2025-05-07T20:03:55.3842397Z U c10::operator<<(std::ostream&, c10::SymInt const&) 2025-05-07T20:03:55.3842826Z U c10::throwNullDataPtrError() 2025-05-07T20:03:55.3843161Z U c10::warn(c10::Warning const&) 2025-05-07T20:03:55.3843497Z U c10::warnDeprecatedDataPtr() 2025-05-07T20:03:55.3843906Z U caffe2::TypeMeta::error_unsupported_typemeta(caffe2::TypeMeta) 2025-05-07T20:03:55.3844356Z U caffe2::TypeMeta::typeMetaDatas() 2025-05-07T20:03:55.3844736Z U cudaDeviceGetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.3845338Z U cudaDeviceSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.3845701Z U cudaEventCreateWithFlags@libcudart.so.11.0 2025-05-07T20:03:55.3846046Z U cudaEventDestroy@libcudart.so.11.0 2025-05-07T20:03:55.3846387Z U cudaEventElapsedTime@libcudart.so.11.0 2025-05-07T20:03:55.3846717Z U cudaEventQuery@libcudart.so.11.0 2025-05-07T20:03:55.3847041Z U cudaEventRecord@libcudart.so.11.0 2025-05-07T20:03:55.3847371Z U cudaEventSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.3847720Z U cudaFuncSetAttribute@libcudart.so.11.0 2025-05-07T20:03:55.3848082Z U cudaGetDeviceProperties@libcudart.so.11.0 2025-05-07T20:03:55.3848425Z U cudaGetErrorString@libcudart.so.11.0 2025-05-07T20:03:55.3848764Z U cudaGetLastError@libcudart.so.11.0 2025-05-07T20:03:55.3849087Z U cudaLaunchKernel@libcudart.so.11.0 2025-05-07T20:03:55.3849419Z U cudaStreamQuery@libcudart.so.11.0 2025-05-07T20:03:55.3849749Z U cudaStreamSynchronize@libcudart.so.11.0 2025-05-07T20:03:55.3850104Z U cudaStreamWaitEvent@libcudart.so.11.0 2025-05-07T20:03:55.3852437Z U embedding_ops::split_embedding_backward_codegen_find_long_segments(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, int, bool) 2025-05-07T20:03:55.3854795Z U fbgemm_gpu::asynchronous_complete_cumsum_gpu(at::Tensor const&) 2025-05-07T20:03:55.3855396Z U float at::Tensor::item() const 2025-05-07T20:03:55.3855769Z U float* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.3856168Z U float* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.3856605Z U int* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.3856989Z U int* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.3857409Z U long c10::detail::maybe_wrap_dim_slow(long, long, bool) 2025-05-07T20:03:55.3857839Z U long* at::TensorBase::data_ptr() const 2025-05-07T20:03:55.3858416Z U long* at::TensorBase::mutable_data_ptr() const 2025-05-07T20:03:55.3858790Z U memcpy@GLIBC_2.14 2025-05-07T20:03:55.3859079Z U memset@GLIBC_2.2.5 2025-05-07T20:03:55.3859402Z U operator delete(void*)@GLIBCXX_3.4 2025-05-07T20:03:55.3859790Z U operator new(unsigned long)@GLIBCXX_3.4 2025-05-07T20:03:55.3860357Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.3861166Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.3861911Z U radix_sort_pairs(void*, unsigned long&, int const*, int*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.3862723Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, float const*, float*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.3863507Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, int const*, int*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.3864265Z U radix_sort_pairs(void*, unsigned long&, long const*, long*, long const*, long*, int, int, int, CUstream_st*) 2025-05-07T20:03:55.3865059Z U std::__cxx11::basic_ostringstream, std::allocator >::basic_ostringstream() 2025-05-07T20:03:55.3865936Z U std::__cxx11::basic_ostringstream, std::allocator >::~basic_ostringstream()@GLIBCXX_3.4.21 2025-05-07T20:03:55.3866853Z U std::__cxx11::basic_string, std::allocator >::_M_append(char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.3867934Z U std::__cxx11::basic_string, std::allocator >::_M_assign(std::__cxx11::basic_string, std::allocator > const&) 2025-05-07T20:03:55.3868994Z U std::__cxx11::basic_string, std::allocator >::_M_construct(unsigned long, char)@GLIBCXX_3.4.21 2025-05-07T20:03:55.3869918Z U std::__cxx11::basic_string, std::allocator >::_M_create(unsigned long&, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.3870916Z U std::__cxx11::basic_string, std::allocator >::_M_mutate(unsigned long, unsigned long, char const*, unsigned long) 2025-05-07T20:03:55.3872024Z U std::__cxx11::basic_string, std::allocator >::_M_replace(unsigned long, unsigned long, char const*, unsigned long)@GLIBCXX_3.4.21 2025-05-07T20:03:55.3873091Z U std::__cxx11::basic_stringstream, std::allocator >::basic_stringstream() 2025-05-07T20:03:55.3873937Z U std::__detail::_Prime_rehash_policy::_M_need_rehash(unsigned long, unsigned long, unsigned long) const@GLIBCXX_3.4.18 2025-05-07T20:03:55.3874556Z U std::__throw_bad_alloc()@GLIBCXX_3.4 2025-05-07T20:03:55.3874899Z U std::__throw_bad_array_new_length() 2025-05-07T20:03:55.3875279Z U std::__throw_length_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.3875681Z U std::__throw_logic_error(char const*)@GLIBCXX_3.4 2025-05-07T20:03:55.3876122Z U std::__throw_out_of_range_fmt(char const*, ...)@GLIBCXX_3.4.20 2025-05-07T20:03:55.3876566Z U std::__throw_system_error(int)@GLIBCXX_3.4.11 2025-05-07T20:03:55.3877082Z U std::basic_ios >::clear(std::_Ios_Iostate)@GLIBCXX_3.4 2025-05-07T20:03:55.3878042Z U std::basic_ostream >& std::__ostream_insert >(std::basic_ostream >&, char const*, long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.3878862Z U std::ios_base::Init::Init()@GLIBCXX_3.4 2025-05-07T20:03:55.3879228Z U std::ios_base::Init::~Init()@GLIBCXX_3.4 2025-05-07T20:03:55.3879581Z U std::ios_base::~ios_base()@GLIBCXX_3.4 2025-05-07T20:03:55.3879946Z U std::locale::~locale()@GLIBCXX_3.4 2025-05-07T20:03:55.3880344Z U std::ostream& std::ostream::_M_insert(long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.3880890Z U std::ostream& std::ostream::_M_insert(unsigned long)@GLIBCXX_3.4.9 2025-05-07T20:03:55.3881392Z U std::ostream::operator<<(int)@GLIBCXX_3.4 2025-05-07T20:03:55.3881739Z U std::terminate()@GLIBCXX_3.4 2025-05-07T20:03:55.3882054Z U strlen@GLIBC_2.2.5 2025-05-07T20:03:55.3882367Z U torch::CppFunction::~CppFunction() 2025-05-07T20:03:55.3883228Z U torch::Library::Library(torch::Library::Kind, std::__cxx11::basic_string, std::allocator >, std::optional, char const*, unsigned int) 2025-05-07T20:03:55.3884411Z U torch::Library::_def(c10::FunctionSchema&&, c10::OperatorName*, std::vector > const&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.3885360Z U torch::Library::_impl(char const*, torch::CppFunction&&, torch::_RegisterOrVerify) & 2025-05-07T20:03:55.3886046Z U torch::jit::parseSchema(std::__cxx11::basic_string, std::allocator > const&, bool) 2025-05-07T20:03:55.3887011Z U transpose_embedding_input(at::Tensor, long, at::Tensor, at::Tensor, bool, std::optional const&, long, long, long, bool, std::optional const&, long, long) 2025-05-07T20:03:55.3889489Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.3893326Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.3897080Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.3900820Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.3905114Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.3909190Z U void embedding_ops::grad_mean_vbe_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int, unsigned int) 2025-05-07T20:03:55.3913247Z U void embedding_ops::split_embedding_backward_count_unique_indices_kernel(at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, at::GenericPackedTensorAccessor, int) 2025-05-07T20:03:55.3915263Z U vtable for __cxxabiv1::__class_type_info@CXXABI_1.3 2025-05-07T20:03:55.3915695Z U vtable for __cxxabiv1::__function_type_info@CXXABI_1.3 2025-05-07T20:03:55.3916123Z U vtable for __cxxabiv1::__si_class_type_info@CXXABI_1.3 2025-05-07T20:03:55.3916723Z U vtable for std::__cxx11::basic_stringbuf, std::allocator >@GLIBCXX_3.4.21 2025-05-07T20:03:55.3917397Z U vtable for std::basic_streambuf >@GLIBCXX_3.4 2025-05-07T20:03:55.3917839Z w _ITM_deregisterTMCloneTable 2025-05-07T20:03:55.3918165Z w _ITM_registerTMCloneTable 2025-05-07T20:03:55.3918464Z w __cxa_finalize@GLIBC_2.2.5 2025-05-07T20:03:55.3918757Z w __gmon_start__ 2025-05-07T20:03:55.3919024Z w __pthread_key_create 2025-05-07T20:03:55.3919322Z w pthread_mutex_lock@GLIBC_2.2.5 2025-05-07T20:03:55.3919652Z w pthread_mutex_unlock@GLIBC_2.2.5 2025-05-07T20:03:55.3920014Z [CHECK] Listing out external shared libraries linked: 2025-05-07T20:03:55.3920522Z + ldd ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.3920897Z 2025-05-07T20:03:55.3921053Z linux-vdso.so.1 (0x00007ffe89fd4000) 2025-05-07T20:03:55.3921332Z libc10.so => not found 2025-05-07T20:03:55.3921574Z libc10_cuda.so => not found 2025-05-07T20:03:55.3922314Z fbgemm_gpu_tbe_training_backward.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so (0x00007f5494c00000) 2025-05-07T20:03:55.3923088Z libtorch.so => not found 2025-05-07T20:03:55.3923341Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3923601Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3923868Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.3924192Z libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f549499c000) 2025-05-07T20:03:55.3924638Z libgomp.so.1 => /lib64/libgomp.so.1 (0x00007f54c04e9000) 2025-05-07T20:03:55.3925035Z libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f54c04bb000) 2025-05-07T20:03:55.3925514Z libc.so.6 => /lib64/libc.so.6 (0x00007f5494794000) 2025-05-07T20:03:55.3925837Z /lib64/ld-linux-x86-64.so.2 (0x00007f54c0545000) 2025-05-07T20:03:55.3926130Z libc10.so => not found 2025-05-07T20:03:55.3926353Z libc10_cuda.so => not found 2025-05-07T20:03:55.3926945Z fbgemm_gpu_tbe_common.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so (0x00007f549459e000) 2025-05-07T20:03:55.3927995Z fbgemm_gpu_sparse_async_cumsum.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so (0x00007f54944a6000) 2025-05-07T20:03:55.3928715Z libtorch.so => not found 2025-05-07T20:03:55.3929193Z fbgemm.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so (0x00007f5493e00000) 2025-05-07T20:03:55.3930080Z fbgemm_gpu_tbe_utils.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so (0x00007f5493400000) 2025-05-07T20:03:55.3930692Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3930944Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3931381Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.3931849Z libm.so.6 => /lib64/libm.so.6 (0x00007f54943cb000) 2025-05-07T20:03:55.3932165Z libc10.so => not found 2025-05-07T20:03:55.3932411Z libc10_cuda.so => not found 2025-05-07T20:03:55.3933022Z fbgemm_gpu_config.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so (0x00007f54c04aa000) 2025-05-07T20:03:55.3933674Z libtorch.so => not found 2025-05-07T20:03:55.3933922Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3934181Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3934447Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.3934698Z libc10.so => not found 2025-05-07T20:03:55.3934936Z libc10_cuda.so => not found 2025-05-07T20:03:55.3935185Z libtorch.so => not found 2025-05-07T20:03:55.3935445Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3935700Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3935962Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.3936210Z libc10.so => not found 2025-05-07T20:03:55.3936714Z asmjit.so => /__w/FBGEMM/FBGEMM/fbgemm_gpu/./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so (0x00007f54b4d88000) 2025-05-07T20:03:55.3937277Z libtorch.so => not found 2025-05-07T20:03:55.3937519Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3937779Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3938028Z libtorch.so => not found 2025-05-07T20:03:55.3938266Z libc10.so => not found 2025-05-07T20:03:55.3938490Z libc10_cuda.so => not found 2025-05-07T20:03:55.3938752Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3939006Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3939277Z libcudart.so.11.0 => not found 2025-05-07T20:03:55.3939536Z libtorch.so => not found 2025-05-07T20:03:55.3939770Z libc10.so => not found 2025-05-07T20:03:55.3940006Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3940259Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3940608Z libpthread.so.0 => /lib64/libpthread.so.0 (0x00007f54c049b000) 2025-05-07T20:03:55.3940981Z libtorch_cpu.so => not found 2025-05-07T20:03:55.3941242Z libtorch_cuda.so => not found 2025-05-07T20:03:55.3941490Z libtorch.so => not found 2025-05-07T20:03:55.3941773Z librt.so.1 => /lib64/librt.so.1 (0x00007f54c0494000) 2025-05-07T20:03:55.3942008Z 2025-05-07T20:03:55.3942113Z [CHECK] Displaying ELF information: 2025-05-07T20:03:55.3942603Z + readelf -d ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_vbe.so 2025-05-07T20:03:55.3943000Z 2025-05-07T20:03:55.3943005Z 2025-05-07T20:03:55.3943166Z Dynamic section at offset 0xaed9e48 contains 39 entries: 2025-05-07T20:03:55.3943541Z Tag Type Name/Value 2025-05-07T20:03:55.3943949Z 0x0000000000000001 (NEEDED) Shared library: [libc10.so] 2025-05-07T20:03:55.3944608Z 0x0000000000000001 (NEEDED) Shared library: [libc10_cuda.so] 2025-05-07T20:03:55.3945141Z 0x0000000000000001 (NEEDED) Shared library: [fbgemm_gpu_tbe_training_backward.so] 2025-05-07T20:03:55.3945679Z 0x0000000000000001 (NEEDED) Shared library: [libtorch.so] 2025-05-07T20:03:55.3946147Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cpu.so] 2025-05-07T20:03:55.3946638Z 0x0000000000000001 (NEEDED) Shared library: [libtorch_cuda.so] 2025-05-07T20:03:55.3947120Z 0x0000000000000001 (NEEDED) Shared library: [libcudart.so.11.0] 2025-05-07T20:03:55.3947640Z 0x0000000000000001 (NEEDED) Shared library: [libstdc++.so.6] 2025-05-07T20:03:55.3948105Z 0x0000000000000001 (NEEDED) Shared library: [libgomp.so.1] 2025-05-07T20:03:55.3948571Z 0x0000000000000001 (NEEDED) Shared library: [libgcc_s.so.1] 2025-05-07T20:03:55.3949066Z 0x0000000000000001 (NEEDED) Shared library: [libc.so.6] 2025-05-07T20:03:55.3949534Z 0x0000000000000001 (NEEDED) Shared library: [ld-linux-x86-64.so.2] 2025-05-07T20:03:55.3950119Z 0x000000000000000e (SONAME) Library soname: [fbgemm_gpu_tbe_training_backward_vbe.so] 2025-05-07T20:03:55.3950641Z 0x000000000000000f (RPATH) Library rpath: [$ORIGIN] 2025-05-07T20:03:55.3951021Z 0x000000000000000c (INIT) 0x1ad000 2025-05-07T20:03:55.3951323Z 0x000000000000000d (FINI) 0xe4d99c 2025-05-07T20:03:55.3951641Z 0x0000000000000019 (INIT_ARRAY) 0xaed55e8 2025-05-07T20:03:55.3951973Z 0x000000000000001b (INIT_ARRAYSZ) 680 (bytes) 2025-05-07T20:03:55.3952293Z 0x000000000000001a (FINI_ARRAY) 0xaed5890 2025-05-07T20:03:55.3952680Z 0x000000000000001c (FINI_ARRAYSZ) 8 (bytes) 2025-05-07T20:03:55.3953002Z 0x000000006ffffef5 (GNU_HASH) 0x238 2025-05-07T20:03:55.3953509Z 0x0000000000000005 (STRTAB) 0x1b3a0 2025-05-07T20:03:55.3953827Z 0x0000000000000006 (SYMTAB) 0x5920 2025-05-07T20:03:55.3954182Z 0x000000000000000a (STRSZ) 1481806 (bytes) 2025-05-07T20:03:55.3954549Z 0x000000000000000b (SYMENT) 24 (bytes) 2025-05-07T20:03:55.3954884Z 0x0000000000000003 (PLTGOT) 0xaedb0f8 2025-05-07T20:03:55.3955241Z 0x0000000000000002 (PLTRELSZ) 22176 (bytes) 2025-05-07T20:03:55.3955572Z 0x0000000000000014 (PLTREL) RELA 2025-05-07T20:03:55.3955895Z 0x0000000000000017 (JMPREL) 0x1a6bf0 2025-05-07T20:03:55.3956219Z 0x0000000000000007 (RELA) 0x186df0 2025-05-07T20:03:55.3956571Z 0x0000000000000008 (RELASZ) 130560 (bytes) 2025-05-07T20:03:55.3956918Z 0x0000000000000009 (RELAENT) 24 (bytes) 2025-05-07T20:03:55.3957238Z 0x0000000000000018 (BIND_NOW) 2025-05-07T20:03:55.3957570Z 0x000000006ffffffb (FLAGS_1) Flags: NOW 2025-05-07T20:03:55.3957913Z 0x000000006ffffffe (VERNEED) 0x186cd0 2025-05-07T20:03:55.3958245Z 0x000000006fffffff (VERNEEDNUM) 5 2025-05-07T20:03:55.3958563Z 0x000000006ffffff0 (VERSYM) 0x184fee 2025-05-07T20:03:55.3958895Z 0x000000006ffffff9 (RELACOUNT) 811 2025-05-07T20:03:55.3959198Z 0x0000000000000000 (NULL) 0x0 2025-05-07T20:03:55.3959405Z 2025-05-07T20:03:55.3959515Z ################################################################################ 2025-05-07T20:03:55.3959746Z 2025-05-07T20:03:55.3959750Z 2025-05-07T20:03:55.3959962Z [CHECK] Verifying sample subset of symbols in the built libraries ... 2025-05-07T20:03:55.4022586Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4049028Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4109752Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4141539Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4381194Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4415987Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4453058Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4480615Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::asynchronous_inclusive_cumsum_cpu 2025-05-07T20:03:55.4596376Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/asmjit.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.4621603Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_config.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.4674837Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_cache.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.4717246Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_optimizers.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.4948502Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.4984564Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_common.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.5019589Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_utils.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.5043026Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_sparse_async_cumsum.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.5452758Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_pt2.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.5821934Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_inference.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.6734975Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_forward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.6954606Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward_split_host.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.7037030Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_index_select.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.7069565Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_embedding_inplace_ops.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.8940645Z [CHECK] Symbol NOT found in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_tbe_training_backward.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.9269965Z [CHECK] Found symbol in ./_skbuild/linux-x86_64-3.10/cmake-build/fbgemm_gpu_py.so: fbgemm_gpu::jagged_2d_to_dense 2025-05-07T20:03:55.9271374Z ################################################################################ 2025-05-07T20:03:55.9271893Z [BUILD] Wheel Audit: dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:03:55.9272313Z 2025-05-07T20:03:55.9272918Z + conda run --no-capture-output -n build_binary auditwheel show dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:03:55.9273493Z 2025-05-07T20:04:04.3292987Z 2025-05-07T20:04:04.3293493Z fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl is 2025-05-07T20:04:04.3294068Z consistent with the following platform tag: "linux_x86_64". 2025-05-07T20:04:04.3294564Z 2025-05-07T20:04:04.3294756Z The wheel references external versioned symbols in these 2025-05-07T20:04:04.3295207Z system-provided shared libraries: librt.so.1 with versions 2025-05-07T20:04:04.3295667Z {'GLIBC_2.2.5'}, libgcc_s.so.1 with versions {'GCC_12.0.0', 2025-05-07T20:04:04.3296076Z 'GCC_3.0'}, libstdc++.so.6 with versions {'CXXABI_1.3.7', 2025-05-07T20:04:04.3296530Z 'GLIBCXX_3.4.9', 'CXXABI_1.3.11', 'GLIBCXX_3.4.20', 'GLIBCXX_3.4.18', 2025-05-07T20:04:04.3297004Z 'GLIBCXX_3.4.21', 'GLIBCXX_3.4', 'CXXABI_1.3.5', 'GLIBCXX_3.4.19', 2025-05-07T20:04:04.3297469Z 'CXXABI_1.3.3', 'GLIBCXX_3.4.11', 'GLIBCXX_3.4.14', 'CXXABI_1.3', 2025-05-07T20:04:04.3297982Z 'GLIBCXX_3.4.15'}, libc.so.6 with versions {'GLIBC_2.17', 2025-05-07T20:04:04.3298410Z 'GLIBC_2.2.5', 'GLIBC_2.3.3', 'GLIBC_2.3.2', 'GLIBC_2.3', 'GLIBC_2.6', 2025-05-07T20:04:04.3298868Z 'GLIBC_2.14'}, libpthread.so.0 with versions {'GLIBC_2.3.4', 2025-05-07T20:04:04.3299347Z 'GLIBC_2.2.5'}, libm.so.6 with versions {'GLIBC_2.2.5'}, 2025-05-07T20:04:04.3299823Z libcudart.so.11.0 with versions {'libcudart.so.11.0'}, libgomp.so.1 2025-05-07T20:04:04.3300318Z with versions {'OMP_1.0'}, libdl.so.2 with versions {'GLIBC_2.2.5'} 2025-05-07T20:04:04.3300637Z 2025-05-07T20:04:04.3300921Z This constrains the platform tag to "manylinux_2_35_x86_64". In order 2025-05-07T20:04:04.3301451Z to achieve a more compatible tag, you would need to recompile a new 2025-05-07T20:04:04.3301932Z wheel from source on a system with earlier versions of these 2025-05-07T20:04:04.3302362Z libraries, such as a recent manylinux image. 2025-05-07T20:04:04.4221293Z 2025-05-07T20:04:04.4221341Z 2025-05-07T20:04:04.4222121Z ################################################################################ 2025-05-07T20:04:04.4222923Z [BUILD] Enumerating the built wheels ... 2025-05-07T20:04:04.4223437Z + ls -lth dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:04.4223821Z 2025-05-07T20:04:04.4238255Z -rw-r--r--. 1 root root 262M May 7 20:03 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:04.4239597Z 2025-05-07T20:04:04.4239916Z [BUILD] Enumerating the wheel SHAs ... 2025-05-07T20:04:04.4241275Z + sha1sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:04.4242344Z 2025-05-07T20:04:04.9185815Z 228315948ebdbf182bae815c2dc49071a79d3d6e dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:04.9187486Z 2025-05-07T20:04:04.9188263Z + sha256sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:04.9189399Z 2025-05-07T20:04:06.0692950Z c74f369e1942a4cfe1d5aa37aff21862288b95f2c528fb2f81c72285d13f8c7f dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:06.0694929Z 2025-05-07T20:04:06.0695701Z + md5sum dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:06.0696793Z 2025-05-07T20:04:06.5111610Z eaee03a3dfa8ef7191fb373b8d9f1464 dist/fbgemm_gpu_nightly-2025.5.7-cp310-cp310-manylinux_2_28_x86_64.whl 2025-05-07T20:04:06.5112332Z 2025-05-07T20:04:06.5112598Z [BUILD] FBGEMM-GPU build + package completed 2025-05-07T20:04:06.5227576Z ##[group]Run actions/upload-artifact@v4 2025-05-07T20:04:06.5227926Z with: 2025-05-07T20:04:06.5228231Z name: fbgemm_default_x86_clang_py3.10_cu11.8.0.whl 2025-05-07T20:04:06.5228585Z path: fbgemm_gpu/dist/*.whl 2025-05-07T20:04:06.5228931Z if-no-files-found: error 2025-05-07T20:04:06.5229201Z compression-level: 6 2025-05-07T20:04:06.5229479Z overwrite: false 2025-05-07T20:04:06.5229757Z include-hidden-files: false 2025-05-07T20:04:06.5230049Z env: 2025-05-07T20:04:06.5230313Z PRELUDE: .github/scripts/setup_env.bash 2025-05-07T20:04:06.5230638Z BUILD_ENV: build_binary 2025-05-07T20:04:06.5230925Z BUILD_TARGET: default 2025-05-07T20:04:06.5231344Z BUILD_VARIANT: cuda 2025-05-07T20:04:06.5231799Z BUILD_CUDA_VERSION: 11.8.0 2025-05-07T20:04:06.5232083Z ##[endgroup] 2025-05-07T20:04:06.5236210Z ##[command]/usr/bin/docker exec 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:04:06.9848163Z With the provided path, there will be 1 file uploaded 2025-05-07T20:04:06.9849508Z Artifact name is valid! 2025-05-07T20:04:06.9850485Z Root directory input is valid! 2025-05-07T20:04:07.0673093Z Beginning upload of artifact content to blob storage 2025-05-07T20:04:07.7499905Z Uploaded bytes 8388608 2025-05-07T20:04:08.1839258Z Uploaded bytes 16777216 2025-05-07T20:04:08.5701654Z Uploaded bytes 25165824 2025-05-07T20:04:09.0041752Z Uploaded bytes 33554432 2025-05-07T20:04:09.2612687Z Uploaded bytes 41943040 2025-05-07T20:04:09.6042509Z Uploaded bytes 50331648 2025-05-07T20:04:09.8906264Z Uploaded bytes 58720256 2025-05-07T20:04:10.1395012Z Uploaded bytes 67108864 2025-05-07T20:04:10.4271194Z Uploaded bytes 75497472 2025-05-07T20:04:10.7594492Z Uploaded bytes 83886080 2025-05-07T20:04:10.9977934Z Uploaded bytes 92274688 2025-05-07T20:04:11.3510252Z Uploaded bytes 100663296 2025-05-07T20:04:11.5753039Z Uploaded bytes 109051904 2025-05-07T20:04:11.9543707Z Uploaded bytes 117440512 2025-05-07T20:04:12.3194215Z Uploaded bytes 125829120 2025-05-07T20:04:12.7145891Z Uploaded bytes 134217728 2025-05-07T20:04:13.2753857Z Uploaded bytes 142606336 2025-05-07T20:04:13.6031723Z Uploaded bytes 150994944 2025-05-07T20:04:14.1275465Z Uploaded bytes 159383552 2025-05-07T20:04:14.6144105Z Uploaded bytes 167772160 2025-05-07T20:04:14.9834686Z Uploaded bytes 176160768 2025-05-07T20:04:15.4781611Z Uploaded bytes 184549376 2025-05-07T20:04:15.9201897Z Uploaded bytes 192937984 2025-05-07T20:04:16.3960059Z Uploaded bytes 201326592 2025-05-07T20:04:16.9927302Z Uploaded bytes 209715200 2025-05-07T20:04:17.3133713Z Uploaded bytes 218103808 2025-05-07T20:04:17.7973093Z Uploaded bytes 226492416 2025-05-07T20:04:18.2774022Z Uploaded bytes 234881024 2025-05-07T20:04:18.6994582Z Uploaded bytes 243269632 2025-05-07T20:04:19.0962903Z Uploaded bytes 251658240 2025-05-07T20:04:19.5515146Z Uploaded bytes 260046848 2025-05-07T20:04:19.9765510Z Uploaded bytes 268051646 2025-05-07T20:04:19.9927208Z Finished uploading artifact content to blob storage! 2025-05-07T20:04:19.9927951Z SHA256 digest of uploaded artifact zip is a98749bacbee05a7467c7ff2d26346ca5ccbfbab68b8b6ac9db410be8560ec5d 2025-05-07T20:04:19.9928619Z Finalizing artifact upload 2025-05-07T20:04:20.0942049Z Artifact fbgemm_default_x86_clang_py3.10_cu11.8.0.whl.zip successfully finalized. Artifact ID 3081410083 2025-05-07T20:04:20.0944738Z Artifact fbgemm_default_x86_clang_py3.10_cu11.8.0.whl has been successfully uploaded! Final size is 268051646 bytes. Artifact ID is 3081410083 2025-05-07T20:04:20.0954193Z Artifact download URL: https://github.com/pytorch/FBGEMM/actions/runs/14891846252/artifacts/3081410083 2025-05-07T20:04:20.1231063Z Post job cleanup. 2025-05-07T20:04:20.1237194Z ##[command]/usr/bin/docker exec 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa sh -c "cat /etc/*release | grep ^ID" 2025-05-07T20:04:20.3996237Z [command]/usr/bin/git version 2025-05-07T20:04:20.4033475Z git version 2.47.1 2025-05-07T20:04:20.4066892Z Copying '/github/home/.gitconfig' to '/__w/_temp/cd49e99a-813b-4870-8a6a-65761863692e/.gitconfig' 2025-05-07T20:04:20.4080571Z Temporarily overriding HOME='/__w/_temp/cd49e99a-813b-4870-8a6a-65761863692e' before making global git config changes 2025-05-07T20:04:20.4082941Z Adding repository directory to the temporary git global config as a safe directory 2025-05-07T20:04:20.4084867Z [command]/usr/bin/git config --global --add safe.directory /__w/FBGEMM/FBGEMM 2025-05-07T20:04:20.4125763Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-05-07T20:04:20.4160969Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-05-07T20:04:20.4434645Z Entering 'external/asmjit' 2025-05-07T20:04:20.4480696Z Entering 'external/composable_kernel' 2025-05-07T20:04:20.4557321Z Entering 'external/cpuinfo' 2025-05-07T20:04:20.4611749Z Entering 'external/cutlass' 2025-05-07T20:04:20.4674188Z Entering 'external/googletest' 2025-05-07T20:04:20.4733826Z Entering 'external/hipify_torch' 2025-05-07T20:04:20.4792398Z Entering 'external/json' 2025-05-07T20:04:20.4858593Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-05-07T20:04:20.4875227Z http.https://github.com/.extraheader 2025-05-07T20:04:20.4882649Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-05-07T20:04:20.4909349Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-05-07T20:04:20.5172732Z Entering 'external/asmjit' 2025-05-07T20:04:20.5203217Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5233641Z Entering 'external/composable_kernel' 2025-05-07T20:04:20.5265948Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5301051Z Entering 'external/cpuinfo' 2025-05-07T20:04:20.5345014Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5382848Z Entering 'external/cutlass' 2025-05-07T20:04:20.5414864Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5471332Z Entering 'external/googletest' 2025-05-07T20:04:20.5504232Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5540689Z Entering 'external/hipify_torch' 2025-05-07T20:04:20.5587132Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5628351Z Entering 'external/json' 2025-05-07T20:04:20.5661478Z http.https://github.com/.extraheader 2025-05-07T20:04:20.5829780Z Stop and remove container: e02b70d7a92a414ba22decd9f7117672_amazonlinux2023_04e192 2025-05-07T20:04:20.5835233Z ##[command]/usr/bin/docker rm --force 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa 2025-05-07T20:04:21.3716443Z 9d6bffcb154a3aafcf2591fe5a342c22735765e9fd7602d6273e9fda2f8bbbaa 2025-05-07T20:04:21.3746789Z Remove container network: github_network_23c588a7f7ea4423950955615706ca02 2025-05-07T20:04:21.3751611Z ##[command]/usr/bin/docker network rm github_network_23c588a7f7ea4423950955615706ca02 2025-05-07T20:04:22.4979896Z github_network_23c588a7f7ea4423950955615706ca02 2025-05-07T20:04:22.5020973Z A job completed hook has been configured by the self-hosted runner administrator 2025-05-07T20:04:22.5040730Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-05-07T20:04:22.5046651Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-05-07T20:04:22.5047080Z ##[endgroup] 2025-05-07T20:04:22.5156378Z [!ALERT!] Swap in detected! [!ALERT!] 2025-05-07T20:04:32.7108666Z [!ALERT!] Swap out detected [!ALERT!] 2025-05-07T20:04:48.7332477Z Cleaning up orphan processes